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Abstract. We investigate a continuous time, probability measure valued dy- 
namical system that describes the process of mutation-selection balance in a 
context where the population is infinite, there may be infinitely many loci, 
and there are weak assumptions on selective costs. Our model arises when we 
incorporate very general recombination mechanisms into a previous model of 
mutation and selection from Steinsaltz, Evans and Wachter (2005) and take 
the relative strength of mutation and selection to be sufficiently small. The 
resulting dynamical system is a flow of measures on the space of loci. Each 
such measure is the intensity measure of a Poisson random measure on the 
space of loci: the points of a realization of the random measure record the 
set of loci at which the genotype of a uniformly chosen individual differs from 
a reference wild type due to an accumulation of ancestral mutations. Our 
main motivation for working in such a general setting is to provide a basis for 
understanding mutation-driven changes in age-specific demographic schedules 
that arise from the complex interaction of many genes, and hence to develop 
a framework for understanding the evolution of aging. 
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CHAPTER 1 



Introduction 



A principal goal of population genetics is to understand how the mechanisms of 
mutation, selection and recombination determine the manner in which the distribu- 
tion of genotypes in a population changes over time. The distribution of genotypes 
dictates, to a large degree, the distribution of phenotypes, an object that is both 
more readily observable and of more immediate practical interest. Conversely, the 
differential survival of genotypes is mediated almost exclusively by their phenotypic 
expression. 

More specifically, mutation accumulation models that incorporate the counter- 
vailing forces of recurring, slightly delete rious m utations and persistent selection 



have been staples of evolutionary theory BOO . In order to use these models to 
explain phenomena such as aging that are presumed to result from the combination 
of many small mutational impacts, it is necessary to adopt a multilocus perspec- 
tive. However, as Pletcher and Curtsinger [PC98 point out, early progress in this 
area has relied on simplifying assumptions that are severely limiting and possi- 
bly unfounded: equal impacts from all mutations, additive effects of mutations on 
age-specific survival, and the existence of mutations that impact a specific, narrow 
range of ages. 

Several general multilocus formalisms have been proposed. Notable among 
these is that of [KJB02 , which is designed to allow almost any conceivable regime 
of selection, mating, linkage, mutation, and phenotypic effects. Such frameworks 
have not been exploited for studies of aging, in part because so much detail is 
counterproductive when what concerns us is not the fate of any individual allele 
but rather the mass of overlapping, age-varying phenotypic effects that are central 
to standard theories of aging. 

K. Dawson Daw99| applied a variant of the Kimura-Maruyama rare-allele 
approximation [KM66, BOO] (see also |Kon82| ) to aging. While this less detailed 
view of the genome is more amenable to theoretical analysis, it is also not suited 
to describing the interacting phenotypic contribution of multiple loci. 

In a previous paper [SEW05J, we proposed a model, which we also describe 



briefly here in the development leading to (1.4), that overcame many of these limita- 



tions. That model leads to computable solutions for mutation-selection equilibria, 
the hazard functions which such equilibria imply, and the time evolution of the 
population distribution of genotypes. 

In this work, we define and analyze a parallel model that, in essence, incorpo- 
rates recombination into the model of [SEW05J. The key assumptions behind the 
model in this work are: 

• the population is infinite, 

• the genome may consist of infinitely many or even a continuum of loci, 

• reproduction is sexual, in that each individual has two parents, 
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• the mechanism of genetic recombination randomly shuffles together the 
genomes of the parents in order to obtain the genome of the offspring, 

• mating is random and individuals are identical except for their genotypes, 
so the population at any time can be completely described by a prob- 
ability measure on the space of possible genotypes — heuristically, the 
empirical distribution of genotypes in the population or, equivalently, the 
probability distribution of the genotype of a randomly sampled individual, 

• individuals are haploid — an individual has only one copy of each gene 
rather than copies from each of two parents, 

• down a lineage, mutant alleles only accumulate — there is no back- 
mutation to cancel out the alleles introduced by earlier mutation events, 

• fitness is calculated for individuals rather than for mating pairs, 

• a genotype becomes less fit when it accumulates additional mutant alleles, 
but otherwise selective costs are arbitrary, 

• recombination acts on a faster time scale than mutation or selection — 
the common quasi-linkage equilibrium (QLE) assumption. 

Our general picture of the genome and the processes of mutation, selection and 
recombination is similar to that of |BT91l IKJB02j . Whereas |BT91L IKJB02] 
invoke the QLE assumption to justify treating the effects of alleles at different loci 
as nearly independent, we present a detailed asymptotic treatment in a standard, 
explicit scaling regime. 

More specifically, we establish conditions under which a discrete-time process 
with the above features converges to a continuous-time, deterministic, dynamical 
system that has its state space the probability measures on the set of possible 
genotypes. At any point in time, the genotype distribution is in the complete linkage 
equilibrium represented by the distribution of a Poisson point process. The points 
of a realization of the Poisson process represent loci at which a randomly sampled 
individual's genotype has accumulated ancestral mutations away from an original 
"reference" genotype which we will call the wild type. Because a Poisson process 
is defined by its intensity measure, which in our case is a finite measure on the 
set of loci, the asymptotic model can be more simply described by a deterministic 
dynamical system that moves about in the space of such measures. 

In order to establish such a convergence result it is, of course, not enough simply 
to show that the distribution of the genotype of a randomly chosen individual at 
a fixed time converges to the distribution of a Poisson random measure. Rather, 
we must keep track of the accumulated perturbations that arise over time from the 
effect of mutation, selection and recombination, and demonstrate the convergence 
of the entire time evolution of the genotype distribution to a dynamical system of 
Poisson distributions. 

Moreover, we establish conditions under which the continuous-time dynamical 
system has equilibria, investigate when the system converges to an equilibrium from 
the pure wild type genotype, and obtain results about the stability and attractivity 
of that particular equilibrium when it is present. 

Our models provide a basis for the rigorous study of a number of questions in 
the biodemography of longevity, as surveyed in [Wac03 : 

• the adequacy of Charlesworth's |Cha01] proposed explanation of Gom- 
pertz hazards and mortality plateaus as a consequence of mutation accu- 
mulation, 
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• the causes of the hyperexponcntial hazards described by Horiuchi [H or03] , 

• extensions to age-structured settings of Haldane's principle that, in its 
original form, equates the mutation rate to population decline of fitness, 

• contrasts between proportional and additive mutation effects on hazards 
as discussed in |Bau05L IBau08j , 

• the possibility of mortality rates diverging to infinity at ages before the 
end of reproduction, which have been termed "Walls of Death" . 

We pursue these matters in the companion papers |WES08| IWSE08] . 

1.1. Informal description of the limit model 

In this section we describe the asymptotic model and motivate why it is rea- 
sonable that such a dynamical system should arise as the limit of a sequence of 
discrete-time systems with the features listed above. 

Denote by M. the collection of loci in the portion of the genome that is of 
interest to us. There is a distinguished reference wild type genotype, and each 
locus represents a "position" at which the genotype of an individual may differ 
from that of the wild genotype. We allow the set M. to be quite general and do 
not necessarily think of it as a finite collection of physical DNA base positions or a 
finite collection of genes. For example, the proposed explanation for the Gompertz 
mortality curve and mortality plateaus at extreme ages in Charlesworth [ChaOl 
suggest taking M. to be a class of functions from R + to R + : the value of such a 
function at time t > represents an additional increment to the mortality hazard 
rate at age t conferred by a mutation away from the wild type at this locus. Some 
structure on M. is necessary to accommodate rigorous probability theory, so we 
take M to be a complete, separable metric space. 

The genotype of an individual is specified by the set of loci at which there has 
been a mutation somewhere along the ancestral lineage leading to that individual. 
More precisely, a genotype is an element of the space Q of integer-valued finite 
Borel measures on M.. An element of Q is a finite sum S m . , where S m is the unit 
point mass at the locus m e M . The measure J2 i S m . corresponds to a genotype 
that has ancestral mutations at loci mi,m2, ■ • ■■ The wild type genotype is thus 
the null measure. We do not require that the loci m, £ M. be distinct. We thus 
allow several copies of a mutation. This is reasonable, since we are not identifying 
mutations with changes in nucleotide sequences in a one-to-one manner. 

For example, if M. is finite, so we might as well take Ai — {1, 2, . . . , N} for some 
positive integer N, then Q is essentially the Cartesian product of N copies of 
the nonnegative integers: A genotype is of the form X^Li n i^ii indicating that an 
ancestral mutation is present rij times at locus j, and we identify such a genotype 
with the nonnegative integer vector (rii, ri2, ■ ■ ■ , njv). 

Recall that the population is infinite and all that matters about an individual 
is the individual's genotype, so that the dynamics of the population are described 
by the proportions of individuals with genotypes that belong to the various subsets 
of Q. We are thus led to consider a family of probability measures P t , t > 0, on 
G, where Pt(G) for some subset G C Q represents the proportion of individuals 
in the population at time t that have genotypes belonging to G. Note that Pt(G) 
also may be thought of as the probability that an individual chosen uniformly at 
random from the population will have genotype belonging to the set G. In other 
words, P t is the distribution of a random finite integer- valued measure on M. For 
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example, if = {1,2,..., N} and we identify Q with the Cartesian product 
as above, then Pt({(ni, n 2 , ■ ■ ■ , ^jv)}) represents the probability that an individual 
chosen uniformly at random from the population will have rij ancestral mutations 
at locus j for j = 1, 2, . . . , N. 

We next indicate how we model mutation, selection and recombination to obtain 
the evolution dynamics for Pt, t > 0. 

Mutation alone. Suppose that there is only mutation and no selection or recom- 
bination. In this case all individuals present in the population at a give time die 
and reproduce at the same rate because differing genotypes do not confer differing 
selective costs and mutations accumulate down lineages because they cannot be 
replaced by recombination. 

We describe the mutation process using a finite measure v on the space of loci 
Ai, where v(B) for B C M. gives the rate at which mutations from the ancestral 
wild type belonging to the set B accumulate along a given lineage. 

Write P t $ = J g $(g) dP t (g) for some test function $ : Q -> R. That is, P t $ 
is the expected value of the real-valued random variable obtained by applying the 
function (f> to the genotype of an individual chosen uniformly at random from the 
population. The content of our assumptions is that when there is only mutation 
Pt, t > 0, evolves according to 

(1.1) j t P t <S> = P t (J [$(■ + 5 m ) - $(■)] v{dm) 

For example, when M. = {1,2,..., N} we have the system of ordinary differ- 
ential equations 

, N 
-P t ({n}) = £ [P t ({n - ej}) - P t ({n})] , 

where ej is the j th coordinate vector. This equation is, of cou rse, ju st a special 
case of the usual equation (see, for example, Section III. 1.2 of BOO] ) describing 



evolution due to mutation of type frequencies in a population where the set of types 
is and mutation from type n to type n + ej occurs at rate v({j}). 



The evolution equation (1.1 1 is of the form 
for a certain linear operator A. We recognize that A is the the infinitesimal gener- 



ator of a ^-valued Levy process, and hence (1.1 1 has the following explicit proba- 
bilistic solution. Let II denote a Poisson random measure onMx M. + with intensity 
measure v ® A, where A is Lebesgue measure; that is, II is a random integer-valued 
Borel measure such that: 

(1) The random variable 11(A) is Poisson with expectation (y® A) (A) for any 
Borel subset A of M x E + . 

(2) If Ai, A2, . . . , A n are disjoint Borel subsets of M. x K+, then the random 
variables II(Afe) are independent. 

Define a C/-valued random variable Z t (that is, Z t is a random finite integer-valued 
measure on AI) by 

Zt := I 5 m dIL((m,u)). 
JMx[0,t] 
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Then, 

P t $ = E [$(W + Z t )] , 

where W is a random measure on M. that has distribution Pq and is independent of 
II. In particular, if Po is itself the distribution of a Poisson random measure, then 
P t will also be the distribution of a Poisson random measure. If we write p t for 
the intensity measure associated with P t (that is, p t is the measure on M. defined 
by pt(M) :— Jg g(M) dP t (g) for M C A4), then p t evolves according to the simple 
dynamics 

p t (M) = p {M) + tu{M). 



Selection alone. Now suppose there is only selection and no mutation or recom- 
bination. We specify the fitnesses of different genotypes by a selective cost function 
S : G -> R+. The difference S(g') - S(g") for g',g" e Q is the difference in 
the rate of sub-population growth between the sub-population of individuals with 
genotype g" and the sub-population of individuals with genotype g'. We make the 
normalizing assumption 5(0) = and suppose that 



(1.2) 



S(g + h)>S(h), g,heG, 



in line with our assumption above that genotypes with more accumulated mutations 
are less fit. 

It follows that at time t > the per individual rate of increase of the proportion 
of the population of individuals with genotype g' is P t S — S(g'). More formally, 

d 



(1.3) 



dt 



P t $ = - P t {$ ■ [5 - P t S]) 



[ <%') \s{g')- [ S{g")dP t {g") 



iq L Jg 
For example, when A4 = {1,2,..., N} we have 



dPt(g'). 



d_ 
dt 



P t (W}) = - 



5(n')-VP t ({n"})5(n") 



^({n'})- 



If 5 is non-epistatic, that is, 5 has the additive property 



then the selective effects of different mutations don't interact. In particular, if Po is 
the distribution of a Poisson random measure, then P t will also be the distribution 
of a Poisson random measure and, writing p t for the intensity measure associated 
with P t as before, we have 



pt(dm') = po(dmf) 



S{8 m „)p a {dm") 



M 



p s (dm') ds. 



However, if 5 is epistatic (that is, non-additive), then P t will, in general, not be 
the distribution of a Poisson random measure — even when P is. 



Combining mutation and selection. If there is mutation and selection, but no 
recombination, then the appropriate evolution equation for P t comes from simply 



6 



1. INTRODUCTION 



combining equations ( |1.1| ) and (1.3): 

d 

(1.4) 

- P t ($-[S-P t S\). 



dt P t $ = P t (J [*(■ + 5 m ) - $(•)] v{dm) 



This is the model introduced and analyzed at length in [SEW05 using the Feynman- 
Kac formula. When A4 = {1, 2, . . . , N}, ( 1.4 1 is a special case of the classical system 
of ordinary differentia l equa tions for mutation and selection in continuous-time - 



see Section III. 1.2 of [BOO for the derivation of an analytic solution that agrees 



with the one that arises from a Feynman-Kac analysis. 



Recombination alone. The effect of recombination is to choose an individual 
uniformly at random from the population at some rate and replace the individual's 
genotype g' with a genotype of the form </(• n M) + g"(- n M c ), where g" is the 
genotype of another randomly chosen individual, M is a subset of M. chosen ac- 
cording to a suitable random mechanism, and M c is the complement of M. Thus, 
recombination randomly shuffles together two different genotypes drawn from the 
population. In order to specify the recombination mechanism fully, we would need 
to specify the recombination rate and the distribution of the segregating set M. 
Suppose, however, that we are in the following regime: 

• recombination acts alone (that is, there is no mutation or selection), 

• the initial population Pq has the property that there does not exist an 
m € M. with Po({g € Q ■ g{{m}) > 0}) > (that is, no single mutation 
from the ancestral wild type is possessed by a positive proportion of the 
initial population) , 

• the mechanism for choosing the segregating set M is such that, loosely 
speaking, if to' and m" are two loci then there is positive probability that 
to' e M and to" S M c (that is, no region of the genome M. is immune 
from the shuffling effect of recombination) . 

Then, under these condition the probability measure Pt will converge as t — > oo to 
the distribution of a Poisson random measure on M. with the same intensity measure 
as P . Moreover, the speed of this convergence increases with the recombination 
rate and so if we take the the recombination rate to be effectively infinite, then 
the distribution P t will be essentially Poisson for all i > with the same intensity 
measure as Pq, irrespective of the details of the recombination mechanism. 



Combining mutation, selection and recombination. We have seen that if 
Pq is the distribution of a Poisson random measure, then mutation preserves this 
property. On the other hand epistatic selection drives the population distribution 
away from Poisson, while increasing rates of recombination push it towards Poisson. 
Thus, when all three processes operate and we consider a limiting regime where 
recombination acts on a much faster time scale than selection and recombination, 
we expect asymptotically that if the initial condition Pq is the distribution of a 
Poisson random measure on M. , then P t will also be the distribution of a Poisson 
random measure for all t > 0. Let pt denote the intensity measure of P t (so that 
p t is a finite measure on the space A4 of loci). If we write X n for a Poisson 
random measure on M. with intensity measure 7r, then we expect from combining 
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the observations above that pt should satisfy the evolution equation 



(1.5) p t (dm) = p (dm) + tv(dm) - E [S(X Ps + S m ) — S(X Ps )] p s (dm) ds 



We define the rigorous counterpart of (1.5 1 in Chapter [2] and establish the 
existence and uniqueness of solutions. Furthermore, we show in Chapter [4] that our 
dynamical equation is indeed a limit of a sequence of standard discrete generation, 
mutation-selection-recombination models . 

Given that it involves computing an expected value for a quite general Poisson 



process, equation ( 1.5 1 may look rather forbidding. However, for certain reasonable 
choices of selective costs the infinite sum can be evaluated explicitly, leading to a 
simpler and more intuitive looking system. We will consider three such cases in the 
following three sections. First, though, we make a useful observation about how 



loci may be "clumped together" in (1.5 1 



Remark 1.1. Consider two instances (p' t )t>o and (p")t>o of the dynamical 



system (1.51 with respective locus spaces M! and M", associated genotype spaces 
Q' and Q" , mutation intensity measures v' and v" , and selective costs S' and S" . 
Suppose that there is a Borel measurable map T from M.' onto M." such that: 

• the initial measure p ' on J\A" is the push- forward of the initial measure 
p' on X' by the map T, 

• the mutation intensity measure v" on M." is the push-forward of the 
mutation intensity measure v' on M! by the map T, 

• the selective cost S' on Q' has the property that S'(g') = S'(h') whenever 
the push-forwards of g' , b! E Q' by T are the same, 

• the selective cost S"(g") on Q" is given by the common value of S'(g') for 
all g' € Q' that have push-forward by T equal to g". 

Then, for each t > 0, the measure p" on Ai" is the push-forward of the measure 
p' t on M! by the map T. Intuitively, we have a situation in which for a given for 
g" € G" any two genotypes in the set T~ 1 (g") C Q' are indistinguishable in terms 
of their associated selective cost, and so we may identify any two such genotypes 
as being the same. Of course, we cannot recover the finer description p' t from the 
coarser one p'l in general, but if we let P[ and P' t ' be the population distributions 
of genotypes corresponding to p' t and p'[ (that is, P[ and P" are the distributions 
of Poisson random measures on M 1 and M" with intensity measures p' t and p"), 
then the push-forward of P[ by S' is the same as the push- forward of P" by S": 
the population distributions of selective costs agree whether we use the fine or the 
coarse description of genotypes. 

1.2. Example I: Mutation counting 

The simplest special case of our framework occurs when there are many loci 
but the selective cost of a genotype g only depends on the total number of loci 
g(M) at which there have been ancestral mutations away from the wild type. 

For example, suppose that the space M of loci is the unit interval [0, 1] and the 
selective cost S is of the form S(g) = s(g(A4)) for some non-decreasing function 
s : N -> M+ with s(0) = 0. 

We may apply Remark |l.l| and "replace" the locus space M. by a single point. 
Let q := v(M) be the total rate at which mutations occur and write r t := pt(-M) 
for the expected number of ancestral mutations in the genotype of an individual 
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chosen at random from the population at time t > 0. It follows from Remark |1.1| 
that the function r evolves autonomously according to the (non-linear) ordinary 
differential equation 

r = q- ip(r), 

where 

OO 

i>(x) :=x^( S (fc + l)- S (fc))e- a; | r 

k=0 

OO 

= e-^ S (fc)| r (fc- a; ). 
fe=i 

One example of this simplified model, which lines up with models familiar 
from earlier literature, assumes the cost per mutation to be a constant s, so that 
s(k) — sk and ip(x) — sx, and the starting point to be the null genotype. In this 
case, we readily compute that the intensity at time t is r t — (q/s)(l — exp(-st)). 
The intensity converges monotonically to the equilibrium value q/s, the elementary 
expression for mutation-selection equilibrium going back to Haldane. Thus, the 
limiting distribution for the number of mutations from the ancestral wild type is 
Poisson with mean q/s. Our assumption about recombination leads to simpler 
answers than Kimura and Maruyama's treatment of mutation counting without 
recombination (sec KM66J). 

When costs are multiplicative, in the sense that s(k) := 1 — exp(— ka) for 
some constant a > 0, we have ^p{x) — axexp(-ax), where a := 1 — cxp(— a). An 
equilibrium exists only if the mutation rate q is below the maximum of r exp(— r), 
namely e _1 , and in that case rt converges monotonically to the smallest positive 
solution of the equation q — ip(x) = 0. The solution is x — W(— q)/a, where 
W(z) :— J2^=i(~ n ) n ~ 1S n\ i s Lambert's W function - the power series expansion is 
a simple application of the technique known variously as reversion of series or the 
Lagrange inversion formula. These properties for multiplicative costs are general- 
ized in Section [3. 2 1 Note also that when a is small the equilibrium is approximately 
q/c, as one would expect from the observation that in this case s(k) is approxi- 
mately ak for small k and hence this model is approximately the additive one of 
the previous paragraph with s = a. 

If we are interested only in how the population distribution of selective costs 
evolves, then we need consider only (r t )t>o, rather than {pt)t>o- However, we 
should be somewhat careful about how we interpret the biological import of this 



simplification. We justified the dynamical system (1.5 1 as describing the evolving 
population distribution of genotypes defined in terms of the locus space M in a 
population undergoing mutation, selection and recombination. Mathematically, we 
see that in this example we can replace the locus space [0, 1] by a single point for 
the purposes of studying the dynamics of the distribution of selective costs, but this 
does not mean that biologically the multilocus model is identical with a single locus 



model. As we show later, our instance of ( 1.5 1 with locus space [0, 1] arises as a limit 



of discrete generation models in which Poisson random measures appear because 
of the manner in which recombination breaks up and shuffles together genotypes 



from different individuals. Even though our instance of (1.51 with locus space a 



single point is mathematically well-defined, it can't arise as a limit of such discrete 
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generation models because with a single locus there is no way that recombination 
can drive genotype distributions towards Poisson. 



1.3. Example II: Polynomial selective costs 

Recall that when M. = {1,2, ... ,N} we can encode genotypes as ordered 
N— tuples of nonnegative integers, where the entry in the k th coordinate is the 



number of ancestral mutations present at locus k. Then, (1.5 1 becomes the system 
of ordinary differential equations 

(1.6) j t Pt{{j}) = " Pt({j}) E + e j) - S(n)] n e-"" ^^"* . 

n k=l 

As in the previous example, we should not think of {1,2, ... ,N} as being 
the "real" locus space. Rather, we should imagine that there is something like 
a continuum of loci which may be partitioned into N sub-classes in such a way 
that the loci in each sub-class have indistinguishable selective effects, and (|1.6| is 
a reduced description that comes from applying the observation in Remark |1.1| 

A natural family of selective costs is given by those of the polynomial form 



S(g) = ^aig I , 



where the sum is taken over all nonempty subsets / C {1, . . . ,N} and we adopt 
the usual multi-index convention that for a vector v the notation v 1 denotes the 
product riig/^i- The constant a^} for 1 < i < N measures the selective cost of 
mutation i alone, whereas the constant ai for a subset / C {1, . . . , N} of cardinality 
greater than one measures the selective cost attributable to interactions between 
all of the mutations in / over and above that attributable to interactions between 
mutations in proper subsets of /. 



The system of ordinary differential equations (1.6 1 becomes 



(1.7) p k = Vk-J2 ai P*> l < k < N > 

where we write p^ :— p({k}) and := v{{k\), and where denotes the collection 
of subsets of {1, . . . , N} that contain k, see ICE09| . 

It is shown in |CE09] that if > for all k (that is, mutations may occur at all 
loci), > for all i (that is, the individual effect of any mutation is deleterious, 
in keeping with our general assumption on the selective cost), and a/ > for 
all subsets / (that is, the synergistic effects of individually deleterious mutations 



are never beneficial), then the system of equations (1.7 1 has a unique equilibrium 



point in the positive orthant . Moreover, this equilibrium is globally attractive; 
that is, the system converges to the equilibrium from any initial conditions in 
EjI. The condition a/ > for all / certainly implies our standing assumption that 
S(g+h) > S(g) for all g,h £ Q, but it is strictly stronger. It is also shown in [CE09 
that the analogue of this result for general M. holds with a suitable definition of 
polynomial selective costs in terms of sums of integrals against products of the 
measure g with itself. 
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1.4. Example III: Demographic selective costs 

The following model is discussed in |WES08 . WSE08], where there is a more 
extensive discussion of the demographic assumptions. 

For the moment, suppose that the space of loci M. is general. Write £ x (g) for 
the probability that an individual with genotype g G Q lives beyond age x G . At 
age x, the corresponding cumulative hazard and hazard function are thus — log 4 (g) 
and d/dx(— log £ x (g)) 7 respectively. Suppose that the infinitesimal rate that an 
individual at age x G R+ has offspring is f x , independently of the individual's 
genotype. For individuals with genotype g, the size of the next generation relative 
to the current one is J Q f x £ x (g) dx. 

Suppose further that there is a background hazard A and that an ancestral 
mutation at locus m G M contributes an increment 6(m 1 x) to the hazard function 
at age x. Thus, the probability that an individual with genotype g G Q lives beyond 
age x G K+ is 



£ x (g) — exp I —Ax — / 6{m,x) dg(m) 

\ JM 

The corresponding selective cost is 

/•oo /*oo 

S(g) = f x 4(0) dx - / f x 4 (g) dx 
Jo Jo 

(recall that selective costs represent relative rates of increase, and we have adopted 
the normalizing convention that 5(0) = 0). 
Note that 

S(g + 5 m ) - S(g) = J™ (l - e - e( - m ^) f x £ x (g) dx 



and 



E [4(1")] = exp (-Xx -J (l- e - e{rn ' x) ^ %{dm) S j 



for any finite measure n on M. (recall that X™ is a Poisson random measure on M. 
with intensity measure 7r). 

An important issue for demographic applications is whether the solution p tl 



t > 0, of (1.5 1 converges to an equilibrium as t — » oo and, if so, what are the 
features of that equilibrium - in particular, what we can say about E,[£ X (X P °°)], 
the probability that a randomly chosen individual from the equilibrium population 
lives beyond age x. It is not hard to show that if the limit exists, then it must 
be absolutely continuous with respect to the mutation rate measure v and have a 
Radon- Nikodym derivative r m that satisfies 

/>oo 

l = roo(m) / (l-e- e ^)f x E[e x (X^)]dx. 



We study such equilibria further in Section [378] in and in [WES081 [WSE08J. 
In particular, we show in [WES08 that if M. — E+, the mutation rate measure 
v is absolutely continuous with respect to Lebesgue measure, and 9 is of the form 
6{m,x) = rj{m)\ x > m , so that we consider mutations which each provide a point 
mass increment to the hazard at a specific age, then the equilibrium equation turns 
out to be equivalent to a second-order, non-linear, ordinary differential equation in 
one variable that can be solved explicitly. 



1.5. COMMENTS ON THE LITERATURE 
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1.5. Comments on the literature 

We make some brief remarks about the substantial literature on multilocus 
deterministic models in population genetics and its relation to our work; 



A very comprehensive recent reference is Reinhard Burger's book [BOO] (see 
also Burger's review paper [Biir98]). As well as giving an overview of the classi- 
cal models for finitely many alleles at each of a finite number of loci, these works 
consider at length deterministic haploid continuum- of- alleles models in which in- 
dividuals have a type that is thought of as the contribution of a gene to a given 
quantitative trait. The type belongs to a general state space that represents some- 
thing like the trait value (in which case the state space is a subset of R) and is 
often thought of as the combined effect of a multilocus genotype. Each type has 
an associated fitness, which is some fairly arbitrary function from the type space 
to (0, 1]. However, the models do not explicitly incorporate a family of loci, the 
configuration of alleles present at those loci, or a function describing the fitness of 
a configuration. Rather, everything is cast in terms of how fit each type is and how 
likely one type is to mutate into another. 

Certain classes of mutation-selection models without recombination are solved 
explicitly in [WBG98, BW01J using ideas from statistical mechanics. Such models 
may be thought of as either multilocus systems with complete linkage or structured 
single locus systems. We also mention the constellation of papers [BB031 IBaa051 
Baa07, BaaOl] presenting a deterministic model of population change due to 
recombination alone. 

Finally, we make two comments about our model in order to distinguish it from 
others in the literature that superficially might seem to have similar features. 

• We do not present a Markovian stochastic model such as the one in 

SH92J, where the population is described in terms of an evolving Pois- 
son random measure that keeps track of the proportion of alleles at each 
site that are mutant (with a typical site being purely wild type and only 
exceptional sites having a positive proportion of mutants present). For 
us, the evolution of the proportions of different genotypes in the popula- 
tion is described by a deterministic dynamical system living on a space of 
probability measures. If we sample from the evolving probability measure 
at some fixed time, then the resulting individual's genotype is a Poisson 
random measure. 

• Linkage arises in our discrete-time approximation models as the depen- 
dence between loci, a natural consequence of non-additive selection costs, 
is only partially broken up in any finite number of rounds of recombina- 
tion. However, linkage does not appear in the limit model. That is, if 
we sample from the limit population at some time, then the fact that the 
resulting individual's genotype is described by a Poisson random mea- 
sure means that the presence of ancestral mutations from wild type in 
one part of the genotype is independent of the presence of mutations in 
another part. Our limit theorem may therefore be thought of as delineat- 
ing the relative strengths of mutation, selection and recombination that 
lead asymptotically to a situation in which the Poissonizing effect of re- 
combination wins out over the interactions introduced by non-additive 
selection. Understanding how these two forces counteract each other is 
far from trivial and is the most demanding technical task of the present 
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work. We cannot stress too strongly that we have not a priori assumed 
that linkage is not present. 

1.6. Overview of the remainder of the work 



We define the measure- valued dynamical system (1.5 1 rigorously in Chapter [2] 



and establish the existence and uniqueness of solutions. 

We investigate in Chapter [3] whether the dynamical system has equilibria and 
whether these equilibria are stable and attractive. 

We devote Chapters [4] to [8] to showing that the dynamical system is a limit of 
discrete-generation, infinite-population models. We define the discrete-generation 
models in Chapter [3] and we state the convergence theorem and its hypotheses in 
Chapter [5] As a first step towards the proof of the convergence result, we show 
in Chapter [6] that an analogous convergence result holds when the recombination 
mechanism is replaced by a complete Poissonization operation that destroys all 
linkage between loci in a single step. The proof of the actual convergence result is 
quite involved and requires a number of technical estimates that, loosely speaking, 
bound the extent to which selection reintroduces linkage that has been partially 
removed by recombination. We present these preliminary results in Chapter [7] We 
complete the proof of the convergence theorem in Chapter [8] 

The passage from the discrete-generation mutation-selection-recombination model 
to its continuous-time limit involves the "linearization" of operators that describe 
mutation and selection. This linearization is essentially a consequence of the fact 
that logE [e~ T ] is approximately — E[T] when the nonnegative random variable T 
is close to the being constant. A simple quantitative version of this observation is 
used several times in this work. We state the result and prove it in Appendix |A") 



CHAPTER 2 



Definition, existence, and uniqueness of the 
dynamical system 

2.1. Spaces of measures 

Recall from the Introduction that among the fundamental ingredients in our 
model are: 

• the complete, separable metric space M. of loci; 

• a finite Borel measure i/onAf called the "mutation measure" , because it 
describes the rate at which mutations occur in regions of the genome; 

• the space Q of integer-valued finite Borel measures on M. - an element of 
Q represents a genotype thought of as the set of loci at which there have 
been ancestral mutations away from the reference wild type, so that, in 
particular, the null measure represents the wild genotype. 

Recall also that the state of the population at time t > in our model is a 
probability measure Pt on Q that is the distribution of a Poisson random finite 
measure on M. . The distribution of such a Poisson random measure is determined 
by its intensity measure, which is a finite Borel measure on M.. The following 
notation will be useful: 

Notation 2.1. Denote by 7i the space of finite signed Borel measures on M.. 
Let 7i + be the subset of TL consisting of nonnegative measures. 

For 7r £ TL, write 7r + , tt~ € 7i + for the positive and negative parts of 7r appearing 
in the Hahn- Jordan decomposition. Thus, ir = ir + — ir~ . 

We next recall the definition of the Wasserstein metric (also transliterated as 
Vasershtein, or otherwise). This metric provides a unified way of topologizing the 
various spaces of measures that we use. 

Notation 2.2. Given a metric space (E,d), let Lip be the space of functions 
/ : E -> K such that 

(2.1) ||/|| Lip := sup 1/(1)1 + sup < oo. 

Define a norm || • ||was on the space of finite signed Borel measures on (E, d) by 

(2-2) |M|wa S : =sup{|7r[/]| : \\f\\ Up < 1}, 

where 

(2.3) tt[/] := J fdw. 

An extensive account may be found in |Rac91l IRR98] , while the properties 
used here are described in Problem 3.11.2 of [EK86J. 
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We may build Wasserstein metrics on top of Wasserstein metrics. We start by 
taking M. for the metric space X. Then (/, as a space of measures on Ai, has a 
Wasserstein metric. Next we take Q with its Wasserstein metric for the metric space 
X, and obtain a Wasserstein metric on the finite signed measures on Q, including 
the measures Pt- Also, with M. again playing the role of X, we obtain a Wasserstein 
metric on TL. 

We note that the designation "Wasserstein metric" is often applied to the anal- 
ogous definition where the constraining Lipschitz norm || • 1 1 Lip does not include 
the supremum norm term. This latter distance is always greater than or equal to 
the Wasserstein metric as we are defining it. It is equivalent to the Kantorovich- 
Rubinstcin distance, which is a member of the class of Monge-Kantorovich dis- 
tances, a class defined by a single parameter p; the Kantorovich-Rubinstein dis- 
tance is the element of this class corresponding to p — 1 . Details may be found in 
|Vil03l Section 7.1], |Vil09l Chapter 6] or |AGS05l Section 7.1]. 



2.2. Definition of the dynamical system 

Recall that the state of the population at time t > is given by a probability 
measure P t on Q that is the distribution of a Poisson random measure with intensity 
measure pt- Our informal description of the evolution of pt, and hence of Pt, is (see 
equation ( 1.5 l) 

(2.4) pt(dm) = po(dm) + t v{dm) - [ E [S(X P ° + 5 m ) - S(X Ps )] p s (dm) ds, 



where we write I 1 for a Poisson random measure on M. with intensity measure 
tt S H+. 

In order to formalize this definition, it will be convenient to introduce the 
following two objects. 

Definition 2.3. 



(2.5) 
(2.6) 
(2.7) 



• Define F : M x H+ -> R+ by 

F n (x) := E[S(X 7r + S x ) - S(X*)] for x € M and tt € 

• Define the operator D : H + — > H + by 

d(DTT) 



dn 



-(x) :=F„{x). 



That is, for any bounded / : M. 

f(x) {D-K){dx) = 



M 



f{x)F„(x) n(dx). 



M 



Formally, a solution to ( 1.5 1 is an 7i + -valued function p that is continuous (with 
respect to the topology of weak converge of measures) and satisfies 



(2.8) 



for all t > 0. 



Pt — Pa + tv - I Dp s ds 



Equation (2.8 1 involves the integration of a measure- valued function, and such 
an integral can have a number of different meanings. We require only the following 
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notion: If r\ : R + — > TL is a a Borel function, then for t > the integral X t — f Q rj s ds 
is the element of TL satisfying 

(2.9) I t {A) = [ rj s (A) ds for every Borel A C M. 

Jo 

This integral certainly exists (and is unique) if the function rj is continuous with 
respect to the topology of weak convergence of measures. For more information 
about integration on infinite dimensional spaces, see chapter 2 of |DU77j . 

2.3. Existence and uniqueness of solutions 

Theorem 2.4. Fix a mutation measure v G TL + and a selective cost S : Q — > 
M+. that satisfies the conditions 

• 5(0) = 0, 

• S(g) < S{g + h) for all g, heQ, 

• for some constant K, \S(g) — S(h)\ < K\\g — ^ w > f or a ^ 9i h ^ § ■ 
Then, equation (2.8 I has a unique solution for any po € 7i + . 

Proof. Fix a time horizon T > and let c > be a constant that will 
be chosen later. Write C([0,T],H) for the Banach space of continuous 7i-valued 
functions on [0, T], equipped with the norm 

||a|| c = sup e~ ct ||a t ||was- 

0<t<T 

Denote by V the closed subset of C([0,T),TL) consisting of 7i-valued functions 
a with «o = Po and 

a+(M) < p {M)+tv{M) 

for < t < T (recall from Notation 2.1 that the measure aj is the positive part in 
the Hahn- Jordan decomposition of the signed measure a t ). 
Define a map A : TL — > TL by 

(Ao); = po — / Da^ds + tv. 
Jo 

Note that A maps V into itself. Moreover, for a, {3 € T, 
||Aa-A/3|| c < sup e- ct / ||Aa+-A/3+ 

0<t<T Jo 



,,, ds 

Was 



< Q Sup^e- c * / K(2 + 6{a+(M)Af3+(M)})\\a s -(3 s \\ Was ds 



by Lemma [2.9| below 



< sup e~ ct f K(2 + 6{p (M) + sv(M)})e cs \\a - [3\\ ds 

0<t<T Jo 



< sup e c * 

0<t<T 



e ct — 1 Crf — 1V C * 4- 1 " 
K(2 + 6 Po (M))- +6Kv(M) y — > -^—— \\a-p\\. 



Thus, A : r — * r is a contraction, provided c is chosen sufficiently large. 
It follows from the Contraction Mapping Theorem that the equation 

(2.10) p t = Ap t = p - [ Dp+ds + tv 
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has a unique solution in T. Furthermore, any function in 7i that is a solution to 



(2.101 must automatically be in T. Therefore, the solution is unique. 

This unique solution is a function taking values in the set of signed measures TL, 
and it remains to show that it actually takes values in the subset TL + of nonnegative 
measures. For any Borel set A C J\A, 

p t (A) = Po (A)- J J F p +(x) dpf(x) ds + tv(A). 

In particular, t \— > pt{A) is continuous. For any Borel set icMwe have 

pt{A)< Po (A)+tu(A). 

Since F 7T (x) < K for all x, 

Pt{A) > po(A) -K f (p (A) + au(A)) ds + tv{A) 
Jo 

= (l-Kt)p (A)+t(l-~) u(A). 

Hence, pt(A) > for < t < 1/K. Because this is true for all A, we have p t G Ti. + 
for < t < 1/K. Iterating this argument, with the time replaced successively by 
the times 1/K, 2/K, . . . gives the result. □ 

2.4. Lemmas used in the proof of existence and uniqueness 

We assume in this section that the hypotheses of Theorem |2 . 4| hold . 
Lemma 2.5. The function F n is in Lip for each finite measure tt € TL + , and 

sup || J F 1 7r (-)||Li P <2X. 

Proof. By definition, 

Ili^Glkip - sup |E[spr + s x ) - s(x«)]\ 

X 

+ sup \E[S(X* + S x ) - S(X*)] - E[S(X* + S y ) - S(X*)]\ /d(x, y) 

< sup K 1 1 S x 1 1 Was + supK\\S x - 6 v \\ Was /d(x,y) 

< 2K. 

a 

Notation 2.6. Given a measure q 6 H + . define II g to be the distribution 
of the Poisson random measure with intensity q. Thus, if P is the distribution 
of a Poisson random finite measure on M. (that is, a Poisson distributed random 
element of G), then P = U^p, where p,P € H + is the intensity measure associated 
with P (that is, pP[F] = J g g[F] dP(g)). 

The next result probably already exists in some form in the literature, but we 
have been unable to find a reference. It says that if the intensities of two Poisson 
random measures are close in the Wasserstein sense, then the same is true of their 
distributions. In the statement of the result, the Wasserstein distance on the left- 
hand side of the inequality is between probability measures on the space Q, while 
the Wasserstein distance on the right-hand side is between finite measures on M.. 



2.4. LEMMAS USED IN THE PROOF OF EXISTENCE AND UNIQUENESS 

Lemma 2.7. For two finite measures tt', tt" € H + 
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7T ii 7r" 



I Was 



I Was' 



PROOF. Fix / : Q -» Lip with ||/|| Lip < 1. Fix 7r',7r" e W+. If tt' = tt" = 
there is nothing to prove. Therefore, suppose without loss of generality that tt' ^ 
and tt'(M) > tt"(M). Set tt* = {tt"{m)/tt i {M))tt i € W+. We have 



liv [/] - lv [/] I < In,, [/] - n w . [/] I + In,, [/] - [/] I . 



Note that if X 71 ' and X 7r are any two Poisson random measures on the same 
probability space with distributions n,/ and H n » , respectively, then 



|n^[/] -n w .[/]| = |e [/(i") - /(x-*) 

< 2P{X 7r ' ^ X"*} 



because |/(g)| < 1 for all g e Q. In particular, if we first build X* and then con- 
struct 1" by the usual "thinning" procedure of independently keeping each point 
of X v with probability tt" (M) / tt' (M) and discarding it with the complementary 
probability, we have 



|IM/] < 2F{X*'(M) + X*'(M)} 

) TT'(M) k 



< 



(M)_ 



k=0 



k=0 



k\ 

tt'{MY 
k\ 

7T"{M) 



( tt"(M) \ 
tt'(M) J 



1 tt"(M) 
tt'{M) 



= 2tt'(M) 1- 

tt'(M)_ 
= 2\tt'{M)-tt"{M)\ 
<2||7r'(M)-7r"(M)|| Was . 
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Sotting r = n*(M) = n"(M), 

liM/i-n^/]! 



< 



E 

k=0 



k\ 



oo _ r k — 1 

— E ~fcT E 

/c=0 ' m=0 



f\J2 S y'J (dyi) ■ ■ ■ n* (dy k ) 

/ ^E^«^ n* (dyi) ■ ■ ■ ir* (dy m )Tr" (dy m+1 ) ■ ■ ■ it" (dy k ) 
J "J $ ( E ) ^(^fi) ' ' ' 7r *(%n+i) 7r "( d J/'n+2) ' ' ' ^"{dy k ) 



\£=1 



< ^rr-r fc *A; sup 

< || vr* - 7r"|| Was 

< ||7r'-7r"||wa S + |7r'(X)-7r"(M)| 

< 2||tt* -7r"||was- 

Putting these bounds together yields 



f(g + S z )n*(dz)~ / f(g + 6 z )*"(dz) 



|lM/]-IV,[/]] <4||7r'-7r"| 



Was, 



as required. 



□ 



Lemma 2.8. For two finite measures n', tt" € TL + , 

sup \F n ,{x) - F„i> (x) | < 8X||tt' - ^"llwas- 

Proof. Fix x e M. Define G : Q -> R + by G(g) := + 4) - S{g))/2K. 
Then, ||G||Li P < 1 by the Lipschitz assumption on S. Note that F w (x) — 2KH W [G]. 
By definition of the Wasserstein distance, 

\F n , (x) ~ F,« (x) | = 2X|LV [G] - LV [G] | < 2K\\U n> [G] - IV [G] || Was . 

The lemma now follows from Lemma 12.71 □ 



Lemma 2.9. For two finite signed measures o~,t £7i, 

\\Da+ - £T+||wa S < K{2 + 8{a+(M) A t+(M)}) \\a - r\\ Was . 
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Proof. Suppose without loss of generality that a + (M) < t + (M). By Lem- 
mas [2751 and Ol 



f(x)Da + (dx)~ I f{x)Dr + {dx) 

F(x 7 a + )f(x)a + (dx) - I F(x,T + )f(x)cr + (dx) 



< 



F(x,T + )f(x)a+(dx) - / F(x,T+)f(x)T+(dx) 



< 8K\\<j+ - T+llwJI/IUa+CM) + 2K ||/||Li P ||<7+ - t 



Was 



< (2X + 8Ka + {M))\\f\\ Lip \\<J - T||was, 
where we have used the fact that ||/7"||u P < ||/'||up||/"||u P for /', /" e Lip. □ 



CHAPTER 3 



Equilibria 



One of the primary problems concerning our dynamical systems is to under- 
stand their asymptotic behavior. We begin the analysis of these asymptotics by 
identifying when fixed points of the motion exist, and then examining whether there 
is convergence to these fixed points from suitable initial conditions. 

We assume throughout this chapter that the assumptions of Theo- 
rem 



2.4 always hold. 



Definition 3.1. A finite fixed point or equilibrium for the motion (the terms 
will be used interchangeably) is a measure p* G TL + at which the driving vector 
field vanishes. That is, is absolutely continuous with respect to v, with Radon- 
Nikodym derivative satisfying 

The equilibrium p* is called stable if for every neighborhood V of there 
is a neighborhood U C V, such that p t E V for all times t if po G U. It is 
called attractive if it is stable and there is a neighborhood Uq of such that 
limt-Kx, Pt = P* whenever p <EU a . 

We introduce the terms box-stable and box-attractive when the above definitions 
hold if "neighborhoods" in the above definitions are replaced by sets of the form 

(3.2) B(p,p') :={p : p<p<p'}, 

where p < p« < p 1 , and both measures p — p* and p' — p* arc mutually absolutely 
continuous with respect to v. 

Remark 3.2. Note that box-stability (respectively, box-attractivity) is a weaker 
condition than stability (respectively, attractivity) , because boxes do not contain 
open neighborhoods in the topology induced by the Wasserstein metric (that is, 
the topology of weak convergence), but open neighborhoods do contain boxes. 

Of course, it is not obvious that the motion needs to have fixed points, since v 
could dominate all fitness costs. This possibility is easy to see in the one-dimensional 



case (when M is a single point), which we described in Section 1.2 and which we 



revisit in Section 3.1 However, we show in Section 3.4 that at least one fixed 



point exists when the mutation measure v is small enough. In order to go further, 
we need to impose additional assumptions. For the case of multiplicative selection 
costs, Section [372] gives a complete description of the fixed points, of which there can 
be 0, 1 or 2. In the remainder of this chapter we then impose a weaker assumption, 
namely that the selective cost is concave, in the sense that the marginal cost of 
an additional mutation decreases as more mutations are added to the genotype. 



(The formal definition is given as the last condition in Theorem 3.11 ) Under 



this assumption, we show in Section |3.5| that trajectories starting from increase 
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monotonically, and we give a sufficient condition for them to converge to a finite 
equilibrium. This equilibrium is dominated by any other equilibrium. If the driving 
vector field at nearby points above this minimal equilibrium are in the negative 
"orthant" , then trajectories starting above the equilibrium will converge down to 
it, so the minimal equilibrium will be box attractive. We give an iterative procedure 
for computing the minimal equilibrium that avoids following the dynamical system 
to large times in Section [3~6) Moreover, the iteration is a useful theoretical tool in 
Section|3.7| where we establish that the equilibria shown to exist for suitably small 



v in Section 3.4 are in fact the same as the minimal equilibria in the concave setting 



and that these equilibria arc then box stable. Finally, in Section 3.8 we apply the 



above results to the demographic selection cost introduced in Section |1.4| 

These results cover many cases of substantive interest, although they do not 
settle all relevant questions. Cases with very small v(M) do not wholly put on dis- 
play the rich array of differences between the full non-linear model and non-epistatic 
additive models. However, the conditions of Corollary |3.14 and Theorem 3.15| can 
often be verified in specific cases, and we have found that they can hold when v(M) 
is only moderately small. 

The example discussed in [WSE08 provides numerical evidence that there can 
be convergence to an equilibrium even without uniformly monotone growth. 



3.1. Introductory example: One-dimensional systems 

Suppose as in Section [P] that the space of loci M. consists of a single point. 
Then, the space of genotypes Q may be identified with the natural numbers No, the 
selective cost S is simply an increasing function from No to K+, and the mutation 
measure v is a positive constant. The space H+ of finite measures measures on Q 
may also be identified with K + . The dynamical system (pt)t>o is K+-valued and 
satisfies 

(3-3) p t = v - F Pt p t , 

where 

(3.4) F p p = e-^^ A m- 

fc=i 

The system has equilibria at solutions to the equation pF p = v and we discussed 
some special cases in Section |1.2| In general, it is possible to construct selective 
costs for which the number of equilibria is arbitrarily large for a given mutation 
rate. For example, suppose the selective cost has magnitude 1 for 1 to 5 mutations 
and magnitude 2 for 6 or more. The resulting function —pF p is shown in Figure^ 
The number of equilibria may be 0, 1, 2, 3, or 4, depending on the value of v. 

However, there are a few things that we can say quite generally about one- 
dimensional systems: 

• As long as S is not identically 0, the same will be true for F p , so there 
will be at least one equilibrium for v sufficiently small. 

• The smallest equilibrium will be attractive, unless it corresponds to a 
local minimum of —F p p, in which case it will attract only trajectories 
coming from below; trajectories starting above the equilibrium will then 
be repelled. 
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Figure 1. Plot of the function ~pF p when S(k) = l{fe>i} + 1{A;>7}. 



• If S* is bounded above, then —F p p is bounded below, and so there is no 
equilibrium for v sufficiently large. 

3.2. Introductory example: Multiplicative selective cost 

It is probably apparent to the reader by now that one cannot hope to get 



anything like a closed-form solution for the dynamical system (2.8 1 for general 



selective costs. However, we first show in this section that it is possible to solve 
(|2.8[) explicitly when the selective cost has the form 



(3.5) 



s (g) = 1 - exp 



a(m) dg(m) 



M 



for some a : M. — ► R+. Moreover, it is possible to analyze in this case whether 
limt^oo pt exists, and we will couple this analysis with a comparison argument 
in later sections to give sufficient conditions for the existence of a limit for more 
general selective costs. 

Suppose that po is absolutely continuous with respect to v. It follows from 
(2.8 1 that pt will have a density <j) t against v, for all £ > 0. These densities will 



solve the equation 



d<j)t(m) 
dt 



1 - E[S(X* + S m ) - S(X")] Mm). 



(3.6) 



Put &fc(£) := J exp(— k<j(m))4>t(m) dv(m) for k £ N, then 
d4> t (m) 



dt 



= !-(!- exp(-o-(m)))exp(&i(£) - b {t)) <j> t {m) . 
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This is an ordinary differential equation for each m £ M. that has the solution 



4>t{m) = exp ( - / (1 - exp(-cr(m))) exp(6i(s) - b (s)) ds 

(I — exp(— <r(m))) exp(6i(r) — bo(r)) dr I ds 









X 


O (m) + 


/ exp H 









Thus, we have reduced the (in general) infinite collection of coupled ordinary dif- 
ferential equations to the problem of finding two functions, bo and b%. 

Of course, this will benefit us only if we have an autonomous system of equations 
in just the functions bo and b\. To this end, set a k := J exp(— fccr(rn)) dv(ra) for 
k £ N. Then, from ( |3.6[ ) we have 

(3.7) dbl ^f L =a k + exp(h(t) ~ b (t))(b k+1 (t) - b k (t)). 

Introduce the generating functions 

A(z 

and 



k=0 



B(z,t) :=J2b k (t) 



k=0 



kl 



The system of ordinary differential equations (3.7 1 then becomes the first order, 
linear partial differential equation 



dB(z,t) 



= A(z)+exp{h(t)-b (t)) 



dB{z,t) 
dz 



B(z,t) 



We may find the general solution of this PDE via the method of characteristic 
curves. Once we get the general solution (which will involve the unknown functions 
bo and &i), we have to impose the conditions 

B{0,t)=b (t) 

and 

dB(0,t) 



dz 



to get the solution we are after. 

Using Mathematica, the PDE has the solution 

B(z,t) — J exp ^— J exp(6i(s) — b (s)) ds^j A (^z + J exp(6i(s) — b (s)) ds^j du 



exp ^— J exp(6i(s) — b (s)) ds^j C (^z + J 



exp(6i(s) - b (s)) ds 



where C(z):=Er=o^(0) |r- 

Therefore, the functions bo and b\ solve the system of equations 

b (t) = J exp \ — J exp(6i(s) — b (s)) ds^j A (J exp(b 1 (s) — b (s)) ds^j du 



exp 



exp(&i(s) — b (s)) ds I C ( / exp(&i(s) — b (s)) ds 
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and 



exp 



exp(6i(s) — b (s)) ds I A' I / exp(fei (s) — 6 (s)) cis I 



( \p I - I exp(6i(s) — &o(s)) ds J C yj exp(&i(s) — &o( s )) 

Note that this system of ordinary differential equations it autonomous: it depends 
only on the unknown functions 60 & n d b\ themselves, together with the known 
functions A (determined by the given mutation and selection) and C (determined by 
the initial conditions) . Hence, these equations could at least be solved numerically. 
(Of course, we started with a system of as many equations as there are points in 
M.. This reduction to a system of two coupled ordinary differential equations is 
only advantageous if the space M. has more than two points.) 

We can now obtain a necessary and sufficient condition for the existence of an 



equilibrium for (2.8 1 when the selection cost is the multiplicative one of (3.5 1. Note 



that if is an equilibrium, then we have 

(3.8) v{dm) - E[S(X P * + 6 m ) - S(X P *)] p*(dm) = 0, 

and so has a density against v that satisfies 



<t>*{rn) 



1 — exp(— 17(771)) ' 



where b* k := Jexp(—ka(m))(f>^(m)dv(m) for k E {0,1} (cf. (3.6 1 ) . Therefore, an 
equilibrium exists if and only if 



1 — exp(— er(m)) 
and there is a constant c > such that 
exp(c) 



dv{m) < 00 



dv(m) — \ exp(— a{m)) 



exp(c) 



1 — exp(— (7(777)) 



dv(m) 



1 — exp(— (7(777)) 
= exp(c) v(M), 

in which case c = &q — b\ . Such a constant exists if and only if 

v(M) < supxexp(— x) = e _1 . 

x>0 

Note that there are three possible cases: 

• If v{M) < e _1 , then there are two equilibria, corresponding to the two 
distinct solutions of ce~ c = v(M). The equilibrium corresponding to the 
smaller c is attractive, while that corresponding to the larger c is unstable. 

• If is(A4) > e" 1 , then there is no equilibrium. 

• If v(A4) — e _1 , then there is a single equilibrium, which is unstable. 



3.3. Prechet derivatives 

We need to introduce some machinery on derivatives of curves and vector fields 
in order to analyze equilibria in more generality Consider a Banach space (X, || • || x) 
and a closed convex cone X + C X. For x € X + , let U x be the closed convex cone 
{t(x' — x) : t > 0, x' G X + }. Consider another Banach spaces. (Y, \\ ■ \\ Y )- Extending 
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the usual definition slightly, we say that a map $ : X + — > Y , is Frechet difTerentiable 
at x G X + , if there is map D x <& : U x — > Y with the properties 

lim \\x' -x\\x 1 ($(x')-$(x)-D x $[x' -x\) = 0, 

D x $[tz] = tD x $[z] for t > and z G U x , D x <S>[z' + z"} = D x <5>[z'] + D x $[z"] for 
z',z" € C/a;, and, for some constant C, ||Ac[2]||y < C||z||x for z € {7 X . It is not 
difficult to show that if <& is difTerentiable at x, then D x $ is uniquely defined. 

As usual, we say that a curve ip : I — ► X, where / C K is an interval, is 
difTerentiable at t G I if the limit 

i, t = \im(t' -t)- 1 ^' -ipt) 

exists. 

For the sake of completeness, we record the following standard fundamental 
theorem of calculus and chain rule. 

Lemma 3.3. Consider an interval I C K and a Banach spaces (X,\\ ■ \\x)- 
Suppose that a curve ip : I — > X is difTerentiable at every t £ I and the curve 
1 1 — ► iht is continuous. Then, 



tpb -4>a= I tpt dt 



for all a,b G I with a < b. 



Lemma 3.4. Consider an interval / CI, two Banach spaces (X, || • \\x) and 
(Y, || • || y), and a closed convex cone X + C X . Suppose for some t G / that the curve 
tp : I — > A% is differentiable at t G / and £/ie map $ : -X+ — > Y is differentiate at 
ipt. Then, the curve <j> o ^ : / — > Y" is differentiable at t with derivative D^ t ^[ip t ]. 

We also have a particular analogue of the product rule. Write Ct(jM,M) for 
the Banach space of bounded continuous functions from M. to K equipped with the 
supremum norm. 

Lemma 3.5. Consider an interval I C 1 and iwo curves 7 : J — * 7i+ and 
/ : J — > C(,(A^,K). Suppose that 7 and / are differentiable at t £ I. Define a 
curve /3 : 1 —> H+ by j3 u := /„ ■ 7„, it G J; £/ia£ is, /3 U is the element ofTi that has 
Radon-Nikodym derivative f u with respect to j u . Then, [3 is differentiable at t with 

$t = ft-it + ft- it- 



Proof. The follows from Lemma |3.4| with X = C b (M,R) x H+, Y = H, 
$(e, rj) — e ■ n, and tp — (/, 7) upon showing that the map $ is difTerentiable at any 
(e,n) G C b (M,R) x H with 

D e ^[{e',r]')] = e' -rj+e-r)' 

and the curve ifi is difTerentiable at t with ip t — (ft,jt)- Both proofs are straight- 
forward and we leave them to the reader. □ 

For 77 G H+ and to', to" G M., set 

K v (m', to") := e[s(X" + 5 m , + S m „) - S{X^ + 6 m „) - S{X^ + S m ,) + 5(X") . 

By our standing assumption the conditions of Theorem |2.4| are in place, the map 
(n, to', to") 1— > K v (m' , m") is bounded. Ideas similar to those behind Lemma 



2.7 
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and Lemma |2.8| establish that this kernel gives the Frechet derivative of the map 

Lemma 3.6. The mapping r\ h- > F v (-) from TL + to C(,(.M,R) is Frechet differ- 
entiable at every point rf G TL+ with derivative D v /F given by 



D^Flrf'^m') 



K v ,(m',m")dT]"{m'' 



M 



Moreover, straightforward coupling arguments establish the following bounds, 
where we recall that the constant K is such that \S(g) — S(h)\ < K\\g — /i||was for 
all g,h G Q . 

Lemma 3.7. For any p,p',r) G H+ with p < p' , 

\\D p F[ V ]\\<2K V {M), 



and 



\\D p F[r{\ -D^ML < WK(p'(M)-p(M))r)(M). 
3.4. Existence of equilibria via perturbation 

We define a family of dynamical systems for u G R+ by 



(3.9) 



(u) (u) 
Pi = Po 



utv- \ Dp^ds. 



That is, we apply the equation (2.8 1 with the mutation measure v replaced by the 
multiple uv. 

Theorem 3.8. Suppose the selective cost of a nonzero genotype is bounded 
below. That is, 

inf{5(<5 m ) :meM} = mi{S(g) : g G Q, g ^ 0} > 0. 

Then, there exists U > such that there is a finite equilibrium for the equation 
(3.9 1 for all u G [0, U]. That is, there exists a measure p^ G TL + such that 



uv = F n 



P 



(«) 



(u) 

Proof. If it exists, pi has a density with respect to v, which we write as 
p {u) eC b (M,R+). The equilibr ium equation then becomes 

(3.10) F pW ■ pW = u, 

where we adopt the convention that F r for a function r is F p where p(dm) = 
r(rn)v(dni). The theorem is proved if we can show that there is a differentiable 
family of functions that satisfy p^ — and ( 3.10 ) in a neighborhood of u = 0. 



Differentiating with respect to u and applying Lemma |3.6| we get 
(3.11) 



K pM (to', to") ^—{m") dv{m") 



M 



du 



du 



For each p G Cb(M, K) we may define the bounded linear operator T p : Cb(M,l 
C b (M,R) by 



(3.12) T p (q) := 



M 



K p+ (rri, to") q(m") dv{m") 



p + (m')+F p+ (m')q(m'), 



where p + G C(,(A4,M+) is defined by p+(m) — (p(m)) + , m G M. Let V be the 
subset of p G Cb(M,R) such that T p is invertiblc. It follows that OgP, because 
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T (q)(m') — S(8 m ')q{m!) and inf{5(<5 TO ) : m € M} > by assumption. A standard 
result in operator theory (see Lemma VII. 6.1 of [DS88]) tells us that the invertible 
operators form an open set in the operator norm topology, so that T> includes all p 
such that 1 1 T p — Tq| is sufficiently small, where || ■ || denotes the operator norm. By 



Lemma 2.8 we can bound \\T p — Tq\\ by 10-ftTHpUoo, from which we conclude that T> 
includes an open ball around 0. 

Define a map L : V -> C b (M, K+) by 

L(p) -T-^l), 

where 1 € Cb(M,M. + ) is the function with constant value 1. The map L is contin- 
uous, and 

= T p (L(p))-T o (L(0)) 
= (T + (T p - T o ))(T(0) + (L(p) - T(0))) - T o (T(0)) 
= (T p - Tq)T(O) + T (L(p) - L(0)) + (T p - T a )(L(p) - L(0)). 



Thus, 



L(p) = L(0) - T^fa - To)L(O) - (T p - T )(T(p) - T(0))). 



Since T(0)(m) is bounded away from 0, it follows that there is a neighborhood V of 
such that < inf pe v v^-meM L{p)(m). Hence, by standard results on existence 
and uniqueness of solutions to ordinary differential equations in a Banach space, 
the ordinary differential equation dp du = L(p^) with initial condition p^ = has 
a solution on an interval [0, U] and this solution takes values in Cb(A4,R + ). Thus 



(p^ u ')o<u<u satisfies (3.11) and hence also (3.101. □ 



3.5. Concave selective costs 

For an important class of examples — including the demographic example of 
Section [L4| — the selective costs are concave, in the sense that the marginal cost 
of adding a given mutation becomes smaller, the more other mutations are already 
present. Formally, this is stated in Definition |3.9| Under a few mild constraints, we 
show in Theorem 3.11 that concave selective costs yield monotonic solutions (pt)t>o 



when started from the pure wild type population 0, and hence such systems must 
either diverge to a measure with infinite total mass or converge to an element of 7i+. 



Corollary 3.14 gives a further condition that is sufficient to ensure the limit is an 
element of 7i+ . Our conditions for monotone increase in p over time turn out to be 
satisfied quite generally for the applications we have investigated. The conditions 
for the existence of a limit in 7i + , on the other hand, are not always satisfied, 
and there are important cases (discussed in [WES08J) for which p t increases to a 
measure with infinite total mass. 

Definition 3.9. A selective cost function S is concave if 
(3.13) S(g + h + k) - S(g + h) < S(g + k) - S(g) for all g, h, k S Q. 

Lemma 3.10. A selective cost S is concave if and only if 

S(g + S m + S m >) - S(g + S m ) < S(g + 5 m >) - S(g) 
for all g £ Q and to, m! € M.. 
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Proof. Since elements of Q have finite integer mass, we can pro ve (|3 . 13 > by 
induction on n :— max{h(Ai) , k(Ai)}. Our assumption is equivalent to ( |3.13[ ) in the 
case n = 1. Assume now that (3.131 holds whenever ma,x{h(A4), k(A4)} < n — 1. 



Suppose h(M) = n and k(M.) < n — 1. Let to be in the support of /i, and let 
h = h + S m . Then 



S(g+h + k)- S(g + h)- S{g + k) + S(g) 

= S(g+ k + h + S m ) - S(g + h + 5 m ) - S(g + k) + S(g) 
= [S(g + 5 m + k + h)- S(g + 5 m +h)- S(g + S m + k) + S{g + S m )] 
+ [S(g + S m + k)- S(g + S m ) ~ S(g + k) + S(g)] . 

Since h and k both have mass smaller than n, each of the terms in brackets is < 
by the induction hypothesis. To complete the induction, we need only address the 
case when h(Ai) = k(A4) — n; this proceeds exactly as above. □ 



Theorem 3.11. Fix a mutation measure v 6 Ti + and a selective cost S : Q 
* + , that satisfies the conditions 

• 5(0) = 0, 

• S(g) < S(g + h) for all g,heQ, 

• for some constant K , \S(g) — S(h)\ < K\\g — ^|| Was ; f or a H g,h 6 Q , 

• sup geg S(g) < oo, 

• S(g + h + k)~ S(g + h) < S(h + k) - S(h) for all g, h,keg. 



If po > (< 0), then the solution of equation (2.8) guaranteed by Theorem 2.4 
satisfies p s < p t (p s > Pt) for all < s < t < oo. 

PROOF. Recall that F v {m) = E[S(X^ + S m ) - S{X^)] for rj E H+ and m e M. 
It is clear from the assumptions that n i— > F v (-) is a map from H+ to Cb(M,R). 
The curve p is differcntiable at each t > and satisfies 



(3.14) 



Pt=v-F pt - p t . 



The right-hand side is continuous in t. By Lemma 3.3 it then suffices to show that 
p t > for all t > 0. 



with 



and, by Lemma |3. 5 



Define 



By Lemma 3.4 and Lemma 3.6 the curve t *— > F pt , t E R+, is differentiable, 



-F pt = D Pt F[ptl 



Pt 



dt 



Pi 



(D Pt F[pt])-pt-F p 



Pt- 



It ■-- 



- exp 



F Ps ds r Pt 
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Then, 



(3.15) 



djt 
dt 



cxpjy F Pc ds^ (F Pt p t + p t ) 

exp U F Ps ds} (F PtPt - (p pt F[ P t]) ■ Pt - F pt 



■ Pt 



exp 



F Pa ds\(-D Pt F 



exp \ - I F Ps ds )■ ■ -ft 



■ Pt- 



Suppose now that 70 = — f>o < 0. For any Borel set B C J\4 
lt{B) <7t(B)- 70(B) 



-K Ps (m', m") exp <^ - / F Pr (m") dr \ d-y s {m") 
% jb l Jm I Jo 



F Pr (rn!) dr > dp s (m')j ds. 



x exp 

Put fit '■— sup^,-^ 7 t (B), where the supremum is taken over Borel sets. Because 
—K and F are both nonnegative and bounded and since p s (M) < sv(M), for any 
positive T there is a positive Ct such that 

Pt<Qr I Psds 



for < t < T. By Gronwall's Inequality [SY02, D.l], this implies that ft < for 
all t. It follows that the measure 7t is nonpositive. Thus, pt (which differs from j t 
by a strictly negative Radon-Nikodym factor) is nonnegative. 

If Po 5: 0, then we define 74 to be +exp | J * i ;l p s ds| pt, and then the rest of the 
proof carries through as above. □ 



Corollary 3.12. Suppose the conditions of Theorem \3.11\ hold, and there 
exists p** G TL + satisfying the equilibrium condition 

v{dm) — E [S(X P ** + S m ) - S(X p *')} p^(dm) 



for equation (2.8 1. For the process started at po = 0, pt T P* € 7~L+, where p* < 



and p* is also an equilibrium for (2.8 I 



The proof of the following comparison result is straightforward and is left to 
the reader. 

Lemma 3.13. Consider two selective cost functions S' and S" that satisfy the 



conditions of Theorem 3.11 Let p' and p" be the corresponding solutions of (2.8 1 



Suppose that S'(g + S m ) — S'(g) < S"{g + S rn ) — S"{g) for all g S Q and m g M. 
and that p Q > p '. Then, p' t > p" for all t > 0. 

If the conditions of Theorem |3. 1 1| hold, trajectories starting from will either 
converge as time goes to infinity to an equilibrium state in H+ or diverge to a 
measure with infinite total mass. We therefore wish to consider conditions that will 
ensure the existence of an equilibrium with finite total mass. One approach is to 
compare the concave selective cost to a multiplicative selective cost. This produces 
the small benefit over the general existence result of Theorem |3.8| of providing an 
explicit value of v(M) that is "small enough" to guarantee the existence of finite 
equilibria. 
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Corollary 3.14. Suppose that the selective cost S : Q — ► K satisfies the con- 
rim") dg(m") 



ditions of Theorem \3 . 1 1\ and also satisfies the bound 

(3.16) S(g + 5 m> ) - S{g) > £ [1 - exp(-r(m'))] exp 

for all m! g M. for some constant £ > and function r : M. 

1 



M 



1 



dv(m) < oo. 



iM 1 ^ exp(-r(m)) 

Suppose also that po is the null measure and v{M) < e -1 ^. 
p* G 7i+ sttc/i £/iaf pt]p* as t — ► oo. 

On £/ie other hand, if the reverse inequality 



such that 



Then, there exists 



(3.17) 5( 9 + <W) - %) < £ [1 - exp(-r(m'))] exp 
holds and v(M.) > e -1 ^, then lim^oo p t (M) = oo. 



r(m") dg(m") 



M 



PROOF. Suppose first that (3.16) holds. Let p" be the solution of 2.8 with 
selective cost S"{g) = £ \ M (\ — exp(— r(m)) dg(m) and initial condition p'^ = 
where 

ia \ exp(c) 

p^(dm) = — — v{dm) 

1 — exp(— T(m)) 

with c£ = exp(c)z^(A^) (such a c exists by the assumption that v(JA) < e^ 1 ^). It 
follows from the results of Section 3.2 that p" = p** for all t > 0. 

Apply Lemma 3.13 with S' — S and and /Jq = to conclude that p t < p** for 
all < > 0. It follows from Theorem 3.11 that pt T p* as i — > oo for some p* G 7i + 
with p* < p**. 

Now suppose instead that the upper bound (3.17) holds. We define p" as before, 
with selective cost S" , but with initial condition p^ = 0. Lemma 3.13 implies then 
that pt > p'/ for all t > 0. We know that p" is increasing in t, and there is no 
finite equilibrium. Suppose R := lim t _»oo pt(-M) < oo. Then for any Borel set A, 
the quantity p'({A) is increasing in t and bounded by R, so it converges to a limit 
p"(A). It is easy to check that A t— > p"(v4) is a measure and that p" converges 
to p" in the Wasserstein metric (that is, in the topology of weak convergence). 
From equation (2.8 1 we know that for any Borel set A, and any s < t, 

< p'l{A) - p" s (A) = l\v{A) - D"pZ(A)) du, 

J s 

where the operator D" is the analogue of the operator D when the selective cost S 
is replaced by the selective cost S". We conclude that v(A) — D"p"(A) > for all 
u > 0, and so 



\p'i{A)-p'i{A)\ 



{v{A)-D"p'i{A))du 



| as t 



oo. 



Since D" p is continuous in p by Lemma |2.5| it follows that the integrand on the 
right-hand side converges to v{A) — D"p"(A), which must then be 0. Since this 
is true for all Borel sets A, it would follow that p" would be an equilibrium for 
the dynamical system with selective cost S" , contradicting the fact that no such 
equilibrium exists. □ 
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The monotone growth of systems with concave fitness cost functions allows us 
to derive a simple sufficient condition for stability of the "minimum equilibrium" 
p* — the fixed point to which the process converges when started from 0. We 
know that all trajectories that start strictly below p* converge asymptotically to 
p*. Since the vector field vanishes at p*, this implies that the derivative of the 
vector field — D Pt F[n] ■ p* — F Pr ■ r\ is nonpositive for all positive directions r\. The 
equilibrium is box stable if this nonpositivity extends to a neighborhood of p* , which 
can be guaranteed if the derivative is actually bounded away from 0, measured by 
its Radon-Nikodym derivative with respect to v. 

Theorem 3.15. (i) Suppose the selective cost function satisfies the conditions 



of Theorem 3.11 and the curve (pt)t>o> started at po — 0, converges to a finite 



fixed point p* . Suppose further that 

k := inf D p F\v]{m) d ^(m) + F p (m) > 0. 
meM dv 

Then the fixed point is box stable. 

(ii) Moreover, if M. is compact and the equation 

D p „F[r)}-p* + F Pr - V = 

has no solution rj 6 H+ that is absolutely continuous with respect to v, then p* is 
box attractive. 

Proof. Consider part (i). Let tp s := p* + sv, and let 4> s = v — F^ s •ip 3) be the 
driving vector field evaluated at the point ip s . Applying Lemma |3.7[ we see that 

inf (DpF[v](m)^-(m) + FJm) 
meM \ dv i 



> k — lOKsv(M) - l&Ksv(M) 2 ( sup ^(m) + s] 

\ m 'eM dv J 

For positive s small enough this bound is positive. Then, 0o — and 



,-dr 

(-D^ r F[v]-F^-v)dr 
< for s sufficiently small. 



Applying Theorem 3.11 we see that when po — tp s , the trajectory p t is monotoni- 
cally decreasing. Thus, for all times t > we have p t € B(p*,ip 3 ). 

Now consider part (ii). Suppose that there is no rj G H+ that is absolutely 
continuous with respect to v and satisfies D Pt F[rj\ ■ p* + F Pt ■ n = 0. For all s 
sufficiently small, the trajectory starting from tp s is monotonically decreasing, and 

(s) 

converges to an equilibrium p\ . If p* were not box attractive, then these equilibria 
would be distinct from p* = p^\ Since the pi^ are all between p* and p* + sv, 

(s) . . r~\ 

we know that p\ — p* is absolutely continuous with respect to v. Consider the 
measures 7r( s ), defined by 
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These arc probability measures on M.. Since M. is compact, the space of probability 
measures on M. is also compact (recall that the Wasserstein norm induces the 
topology of weak convergence), so there is an accumulation point. Let Si, S2, ■ ■ ■ 
be a sequence converging to u, with n 1 - 11 ^ — > 77. Since the vector field vanishes at 
all of these points, the derivative in direction r\ vanishes as well, contradicting our 
assumption, and hence proving that is, in fact, box attractive. □ 

3.6. Iterative computation of the minimal equilibrium 



The measure tt g Ti + is an equilibrium for the dynamic equation ( 2.8 ) if -tt — 



v, where we recall that F n ■ tt is the measure that has Radon- Nikodym derivative 
F„ with respect to tt. This implies that tt is absolutely continuous with respect to 
v with a Radon-Nikodym derivative p that satisfies 

(3.18) F p (m)p(m) = 1, me M, 

where, with a slight abuse of notation, we write F q for F K when k £ 7i + is a measure 
that has Radon-Nikodym derivative q with respect to v. Under the conditions of 



Theorem 3.11 if (3.181 has a solution, then it has a minimal solution that arises 

p t when p is the 

0. 



as the Radon-Nikodym derivative with respect to v of lim t _ 
with po 



solution of (2 



If one is only interested in finding the minimal equilibrium numerically, then it 

An obvious 



would be desirable to be able to do so without having to solve (2 



M 



approach to that problem is to define a sequence of functions p n 
inductively by pq — and 

Pn+i = -s— > n>0. 

It is clear that if S is concave, then F p n < F p > for p' < p" . Because po — 
< Pi, it follows that po < pi < ■ ■ ■■ Moreover, if p** is a solution of (3.18) such 
that f M p**(m)v(dm) < 00, then p n | p, < p** as n — > 00 for some function 
p* : M. — > H.+ and the measure • v is the minimal equilibrium of the dynamical 



equation (2.8 1 



Remark 3.16. If tt e TL+ is an equilibrium for (2.8 1 with Radon-Nikodym 

1 



derivative p against v, then, from (3.181 
tt(A4) = / p(m)dv(m) 

J M 

> 



dv(m) 



M F p( m ) 
1 f 1 

M sn Pg£gb{g + o m )- b{g) J M S{d m ) 

under the concavity assumption. Thus, a necessary condition for the existence of 
an equilibrium in Ti + is that the last integral is finite. 

3.7. Stable equilibria in the concave setting via perturbation 



Suppose that the selective cost is concave. In Section [3^4] we constructed equi- 
libria for sufficiently small mutation measures. We know from Section |3.5| that all 
trajectories starting below an equilibrium p converge asymptotically to an equilib- 
rium that is also dominated by p. This leaves open the question of whether the 
equilibrium constructed by perturbing the dynamical system away from v = is 
the same as the minimal equilibrium p* to which the system converges when started 
from 0. 
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Theorem 3.17. (i) Suppose the selective cost function satisfies the conditions 
of Theorem 3.11 For U > sufficiently small, there is a unique p : [0,U] — > 
C b (M,\ 



M 



solving the equation 
Kpi-u) (m', m") p( u \m") dv(m") 



p {u] \m!) + F „ w (»n')pW(m') = 1, 



with p^ = 0. The measure p^ u 'v G TL+ is the minimal equilibrium for the system 
with mutation measure uv for all u £ [0, U]. Furthermore, the minimal equilibria 
so realized for u < U are box stable. 

(ii) Moreover, if M is compact and the equation 

D Pt F[rf\-p* + F p ,-r 1 = Q 

has no solution r\ £ 7i+ with rj absolutely continuous with respect to v, then p* is 
box attractive. 

Proof. Following the method of Section |3.6| we may compute the minimal 
equilibrium as the limit of the iteration 

u 

Pn+l = p ! 

with po = 0. Let q n be the corresponding iterates for vv, where < u < v. 
The concavity of S implies that p n < q n for all n and 

v u 
ln+1 — Pn+l = p 

_ vF p n ~ uF q n 



F F 



(v-u)F Pn u(F Pn -F q J 



F F 



F F 



- q n ^ Pn-L q n 

< a\v -u\ + b\\q n - p n \\oc 

for appropriate constants a and b, if p n and q n both converge to finite equilibria, 



by Lemma 2. 
Thus, 

\\q„+i -Pn+l||oo < a\v -u\ + b\\q n - p„\\oo- 
Moreover, we can make the constant b less than 1 by taking u and v sufficiently 
small. Iterating, we see that 

lkn+i -Pn+illoo < c\v -u\+ b n \\q -_polU, 

where c = a/(X — b). Hence, the corresponding minimal equilibria, say and q*, 
satisfy 

II?* -P*||oo < C\V - u\. 

Therefore, if we write u i— > for the curve of densities of minimal equilibria, 
then 

< p {v) (to) - p [u) (to) < c(v - u) 

for all to £ M. for all < u < v sufficiently small. It follows that u i— > p( u \m) 
is Lebesgue almost everywhere differcntiable, and is the integral of its derivative. 
Since p^ satisfies the relation p( u >F p M — u for every u, we see by differentiating 
with respect to u that p^> satisfies the differential equation in the statement. By 
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standard uniqueness results for ordinary differential equations, it is the unique 
solution. 

Fix any u € [0, U). The dynamical system with mutation measure Uv converges 
to a finite equilibrium p^v £ 7i + , where p^ u > > pW. Let (pt)t>o be the dynamical 
system with mutation measure Uv started from po — p^ ■ v and let (p' t )t>o be the 
reverse - the dynamical system with mutation measure uv started from p' a — p^ -v. 
Thus, (pt)t>o starts below its equilibrium, and (p' t )t>o starts above its equilibrium. 
We have 

p' =uv- F p( u)p {u) ■v=(u-U)v<0, 
po = uv - F p{u) p (u) ■v=(U-u)v>0. 



Therefore, p^ > p^ u \ and we can conclude from Theorem 3.11 that the measure 
p t stays bounded between the measures p^ ■ v and for all times t, and hence 

the minimal equilibrium p( u > ■ v is box stable. 

Now consider part (ii). We know that the system with mutation measure uv 
converges monotonically downward to an equilibrium when started from p^v. The 
final part of the proof, showing that this equilibrium is in fact ■ v, proceeds 
exactly as in the previous proof of Theorem |3.15| □ 

3.8. Equilibria for demographic selective costs 

As an example of Corollary |3.14| suppose that S is the demographic selective 
cost of Section 11.41 so that 



S(g + <V) - S(g) = J™ (l - e " e (™») f x exp (-Ax - J 9(m" ', x) dg(m")^ dx. 
This selection cost is concave, allowing us to apply the results of Sections |3.5| and 



3.7 If sup TO x 9(m,x) < oo and infminf^gs 9(m,x) > for some set B such that 

hold, with the function t a 



3.14 



j B f x dx > 0, then the conditions of Corollary 
suitable constant and the constant £ sufficiently small. In other words, if there is 
a range of fertile ages over which all deleterious mutations reduce survival by at 
least some minimal amount, then selection keeps the total intensity from going to 
infinity provided v(A4) < e _1 £. 

On the other hand, when sup m x 9(m, x) < oo holds (the lower bound is then 
irrelevant) there is also a (larger) constant t and £ sufficiently large such that the 



reverse inequality (3.171 holds. We then conclude that there is no convergence when 
v(M) is too large. 

For example, suppose that S is the demographic selective cost of Section |F~4] 
with M. = [a,0] for < a < (3 < oo, v is a constant multiple of Lebesgue measure 
on M., f x is constant, and 9(m,x) = rjl^ m ^(x) for some constant rj. Such a 
simplified model has featureless fertility between two ages that represent the onset 
and end of reproduction, mutations associated with effects at specific ages, constant 
mutation rate during the reproductive span, and equal increments to the hazard 
from all mutations. Then, for a suitable constant £, 

S(g + 5 m ) - S(g) < ( (l - e^) [exp (-Am) - exp (-A/3)] < ((m - /?). 

Because 

dm = oo, 

m — p 
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an equilibrium with finite total mass does not exist. Of course, (3.f8) may have a 
solution p such that p ■ v has infinite total mass and so does not belong to 7i + . In 
the setting of Theorem |3.11[ the measure ir = p ■ v could still arise as the increasing 
limit of the solution p t to the dynamic equation (2.8) and be such that population 
average selective cost E[S'(X 71 ')] would be finite. These more delicate possibilities 

are treated in |WES08j . 

Observe, now, that Theorem |3. 15| immediately tells us that the minimal equi- 
librium is box stable when v is sufficiently small. In order to show that it is box 
attractive, though, we need to apply Theorem 3.15 which requires that we show 
the derivative to be nonsingular in a neighborhood of the equilibrium. An equilib- 
rium 7r for the demographic selective cost is a measure n(dm) — p(m)v(dm) where 
p satisfies 



Im 



■>(,. 



^)p{m")dv(m") dx 



p(m) = 1. 



If we differentiate the left-hand side at p in the direction of the function q, 
(giving us the derivative of the vector field) we obtain 

(3.19) ^ 

- J (l~e- e ( m '^)f x exp(-\x- f {l-e- e{ * m "^)p{m")dv{m") 



M 



x ( - / (1 

J M 



j-flCm'",*)) ? ( m ///) dv(m"'yj dx 



p(m!) 



(1 



_ -6{m',x)\ 



)f x exp -Xx - / (1 



_ -6(m",x) 



) p(m") dv(m") I dx 



M 



q(m!) 



1 

p{m') 



p{m') 2 f (l-e- e( - m '<^)f x exp(-\x- [ (1 - e - e (™'») p{ m ") dv{m") 
Ja \ Jm 



(1 _ e -fl(m'V)) g( m '") dv{m"') j dx - q(m') 



M 



As we have already observed, if sup m x 9(m, x) < oo and inf m inf^g b 0{m, x) > 
for some set B such that J B f x dx > 0, then 

S(g + 5 m ,) - S(g) > £ [1 - exp(-r)] exp (-rg(M)) 

for all m! e M. for suitable constants ^ > and r > 0. If v(M) < e _1 £, then there 
is a minimal equilibrium n with 

- i/(M)(l-exp(-r)) ' 
where 7(2) is the solution of — exp(c)z for z < e -1 £. Hence ir — p ■ v with 



sup p(m) < ..... . ... 

meM v(M){l -exp(-r)) 



Observe that 



lim 



so we can vary v (but keep the selective cost fixed) and conclude that 
sup{p(m) : m g M, v(M) < e -1 ^} =: p < 00. 
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The suprcmum over m! € M. of the quantity in brackets on the right-hand side 



of (3.19) is at most 



sup q(m) 



-l+v{M)F f x e 



- \x 



dx 



which is negative for sufficiently small. Thus, Theorem 3.15 implies that the 

minimal equilibrium is box attractive when v(M.) is sufficiently small. 



CHAPTER 4 



Mutation-selection-recombination in discrete time 



We devote the remainder of this work to establishing rigorously the claim that 
we argued heuristically in the Introduction: that the dynamical system (1.5 1, which 
we defined more formally in (2.8), should be the limit of a sequence of infinite 
population mutation-selection-recombination models in the standard asymptotic 
regime where selection and mutation are weak relative to recombination and both 
scale at the same infinitesimal rate in the limit. 

More specifically we show that our continuous time model is indeed a limit of 
the sort of discrete generation, infinite population mutation-selection-recombination 
models considered in [BT91, KJB02], once such models have been extended to 
incorporate our more general definition of genotypes. Such a result not only justifies 
our model as a tractable approximation to more familiar models in the literature, 
but, as we remarked in the Introduction, it also marks out the range of relative 
strengths for mutation, selection, and recombination where this approximation can 
be expected to be satisfactory, and thus where the resulting conclusions that underly 
applications to the demographic study of longevity can be trusted. 



4.1. Mutation and selection in discrete-time 

Consider first the situation without recombination. 

Recall that the distribution of genotypes in a population is described by a 
probability measure on Q, the space of finite integer- valued measures on the set M. 
of loci. 

We consider a sequence of models indexed by the positive integers. In the 
n th model each new birth gets a random set of extra mutations away from the 
ancestral wild type added to those it inherited from its parents. The loci at which 
these mutations occur are distributed as the Poisson random measure X v / n with 
intensity v/n. The fitness of a genotype g 6 Q is e^ s ^ 9 ^ n . We speed up time by a 
factor of n so that n generations pass in 1 unit of time. The net result is that in a 
finite, non-zero amount of time the population is asymptotically subject to finite, 
non-zero "amounts" of mutation and selection. 

We now define operators 9Jt„ and 6 n acting on the space of probabilities on 
Q that describe the transformation in genotype distribution by, respectively, one 
round of mutation and one round of selection. The intuition behind these definitions 
is described in the Introduction. 

Notation 4.1. Given a probability measure P on Q, define new probability 
measures 9Jl n P and & n P by 

(4.1) Tt n P[F]:= [ ¥.[F{g + X u / n )]dP{g) 

JQ 



:•><) 
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and 

r e - s ^)/ n F{g) dP{g) 
(4-2) & n P[F] := g . w , 

for a Borel function F : Q — > K+. 

Note that when n is large, 

¥>{X"/ n = 0} = e*p(-i/(.M)/n) sa 1 - v{M)/n, 

and conditional on the event {X v l n ^ 0} the random measure X v l n is approxi- 
mately a unit point mass with location in M. chosen according to the probability 
measure u(-)/u(A4). Hence, for any probability measure P on Q 



(4.3) lim n(Tl n P[F} - P[F}) = [ ( [ 



F{g + 5 m )-F{g)v{dm)\ dP{g). 

M J 

In particular, if F is of the form F(g) := e~ 9 ^ for some Borel function / : M — ► ! 
then 

(4.4) Wl n P[F] =p\f- e v[e ~ f - x]/r ' 
and so 

(4.5) lim n(M n P[F] - P[F]) = vie' 1 - l]P[F}. 

n — *oo 

Note also that e~ s ^ 9 ^ n w 1 — S(g)/n when n is large and so, when P[S] is finite, 

(4.6) lim n(& n P[F] - P[F]) = P[S ■ F] - P[S]P[F]. 

n — >oo 

When we start with a population genotype distribution P, the population geno- 
type distribution after one generation of mutation and selection is dJl n & n P (assum- 
ing that selection precedes mutation). A trajectory of the resulting discrete-time 
model is defined by iteration: Given an initial population genotype distribution P, 
the population genotype distribution after m generations is (97l„6„) P. 

If, after speeding up time in the n th model, the resulting sequence of trajectories 
has a continuously differcntiable limit P t , this limit should satisfy the equation 

lime- 1 (P t+£ [F] - P t [F]) = lim n(M n & n P[F] - P[F\) 

ej.0 n— too 

= lim n(e n P[F] - P[F}) + lim n{M n & n P[F] - & n P[F]) 

n — >oo n — *oo 

= P[SF] - P[S]P[F] + v[e- } - \\P\F\ 

for test functions F : Q — > R + of the form F(g) = e~ 9 ^ for some Borel function 
/ : A4 — > M_|_. For this special choice of test function, this is precisely the dynamical 
equation defining the recombination-free process that we introduced in jSEW05 
and derived heuristically in equation (1.4 1. In fact, these test functions are enough 
to consider, since they determine probability distributions on Q. Formally, a proof 
that the discrete-time process converges to this continuous-time process would re- 
quire that we prove the existence of the continuously differcntiable limit, a fact that 
we assumed above. 
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4.2. Recombination in discrete-time 

We now introduce recombination. Imitating [B T91j . we think of a recom- 
bination event as taking two genotypes g',g" € Q from the population and re- 
placing the genotype g' in the population by the genotype g defined by g{A) := 
g'(A fl R) + g"(A n R c ), where R C M. is the particular "segregating set" for the 
recombination event. That is, the new individual with genotype g has the same 
accumulated mutations as the individual with genotype g' (respectively, g") for 
"loci" in the set R (respectively, R c ). (Recall that we work in an abstract frame- 
work in which loci are just places at which mutations from wild type can occur 
rather than concrete physical loci per se). From a biological point of view, the 
recombination process would more legitimately be defined on a linear sequence of 
loci, as in [BT9T], so that one can picture recombination caused by crossovers dur- 
ing meiosis. Working with a somewhat abstract space of loci is perhaps a weakness, 
but also a useful simplification, of our approach.) 

We will, of course, think of g' and g" as being chosen independently at random 
according to the particular probability measure describing the distribution of geno- 
types in the population. We will also imagine that the segregating set is chosen at 
random via some suitable mechanism. In order to discuss random sets rigorously, 
we follow formalism described in [ Ken74] and define a u-algebra on sets of Borel 
subsets of M. by the requirement that all incidence functions with Borel subsets are 
measurable. A consequence of this definition is that if S is a random Borel set and 
k is a finite measure, then is a real-valued random variable. We suppose there 
is a probability distribution TZ that describes the distribution of the random set 
of loci that segregate together. We will always assume, without loss of generality, 
that TZ is symmetric in the sense that 

(4.7) K(A) = TZ({R C : Re A}), 

where R c denotes the complement of the set R. 

Notation 4.2. For any Borel measure g on M. and Borel subset R of A4, 
define the Borel measure gn on M. by 

g R (A) :=g(AnR) 

for Borel subsets AcM. Given the distribution TZ of a random subset of M. , define 
the corresponding recombination operator that maps the space of Borel probability 
measures on Q into itself by 

(4.8) mP[f}:= [ [ [ f(g' R + g' R .)dP(g')dP(g")dTZ(R), 

Jb(m) Jq Jq 

where P is a Borel probability measure on Q, f : Q — > K is a bounded Borel function, 
and B(A4) is the collection of Borel subsets of A4. 

Thus, if P describes the distribution of genotypes in the population, then SHP 
describes the distribution of a genotype that is obtained by picking two genotypes 
g' and g" independently according to P and then forming a composite genotype 
that agrees with g' on the set R and g" on the set R c , where the segregating set 
R is picked according to the distribution TZ. Recall that [iP denotes the intensity 
measure of P\ that is, 

(4.9) plP{A) := f g(A)dP(g). 

Jg 
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Remark 4.3. Note that /iO^P = /iP; that is, the intensity measure of the point 
process describing the genotype of a randomly chosen individual is left unchanged 
by recombination. Also, note that if P is the distribution of a Poisson random 
measure, then 9\P = P. 

4.3. The discrete-time model 

Notation 4.4. Given a probability measure P on Q, set Q k := (DVXfl n & n ) k Po- 
That is, if Po describes the population at generation 0, then Qk describes the 
population at generation fc, where we suppose that each generation is produced from 
the previous one by the successive action of selection, mutation, and recombination. 



CHAPTER 5 



Hypotheses and statement of the convergence 

result 



5.1. Shattering of point processes 

We expect recombination to break up dependencies between different parts of 
the genome, so that 9{ k P should be approximately IT^p when k is large, where we 
recall that Tl^p is the distribution of the Poisson random measure with intensity 
[iP. In order that this happens for a given distribution P, it must generically be the 
case that there is a positive probability that the segregating set and its complement 
will both intersect any set with positive /iP mass in two sets that each have positive 
fiP mass. The following condition (with A = ^iP) will be useful for establishing 
quantitative bounds on the rate with which d\ k P converges to H^p. 

Definition 5.1. Given a recombination measure 1Z and a finite measure A on 
M , we say that the pair (1Z, A) is shattering if there is a positive constant a such 
that for any Borel set A, 



\(A) 2 -2 J \(AnR) 2 dU(R) 



\(A) 3 < a 

(5-1) 

= 2 a y \(AnR)\(AnR c )dK(R) 

For example, suppose that N is a finite simple point process on M. = (0, 1]. 
We think of M. in this case as a physical chromosome and N as the set of crossover 
points formed during meiosis. Write < T\ < ... < Tk < 1 for the successive 
points of M. Set T — and Tk+i = 1- Let Z be a {0, l}-valued random variable 
that is independent of N with ¥{Z = 0} = \. Define 1Z to be the distribution of 
the random set given by 

(To,Ti]U(T2,T3]U..., ifZ = 0, 



(T U T 2 ]U{T 3 ,T 4 ]U..., \iZ = \. 

Take A to be any diffuse probability measure (that is, A has no atoms). Suppose 
that there is a constant c such that ¥{N(u, w] = 1} > cA((u, w]) for < u < w < 1. 
This will be the case, for example, if N is Poisson process with intensity measure 
bounded below by a positive multiple of A or N consists of a single point with 
distribution bounded below by a positive multiple of A. Also, for most "reasonable" 
simple point processes with intensity CA for some constant C, it will be the case 
that F{N(u, w] = 1} rj E[iV(u,w]] = CA((u,u>]) when \u — w\ is small, where the 
notation m indicates that the ratio of the two sides is close to 1. 
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Then, 



X(A) 2 -2 J X(AnR) 2 dTZ(R) 

= X{A) 2 -2 l -( [ l A (u)l A (w)P{N((u,w}) =0 mod 2} dX(u) dX(w) 



JO 



2 

l A (u)l A (w)¥{N((u, w]) = 1 mod 2} dA(u) dX(w) 

>[ [ l A (u)l A (w)P{N((u,w}) = 1} dX(u) dX(w) 
J o Jo 

>c/ / lA(w)lA(w)^(( u > w ])^(w)dA(w;). 



/0 JO 

Observe that 



X(A) 3 = 3! / / / l A (u)l A (v)l A (w) dX(u) dX(v) dX(w) 

' {0<u<v<w<l} 



< 3! / / / l A {u)l A (w) dX(u) dX(v) dX(w) 

J J J {0<u<v<w<l} 

= 3! // l A (u)l A (w)X((u,w])dX(u)dX(w) 

J J {0<u<w<l} 

= 3 [ [ l A (u)l A (w)X((u,w])dX(u)dX(w). 
Jo Jo 

Thus, the pair (1Z, X) is shattering with constant a = 3/c. 

Remark 5.2. Note that if the pair (TZ, X) is shattering with some constant a, 



then it follows from (5.1 1 that A ({a;}) = for all x € M., and so the measure A is 
diffuse. 

Remark 5.3. It can be shown that if the pair (TZ, /iP) is shattering and there 
is a constant (3 such that 

J g(A)l {g{A) > 2} dP(g) < (3fiP(A) 2 
for any Borel set A<Z M., then EH fe P converges to H^p as k — > oo. 

5.2. Statement of the main convergence result 



We can now state our main result that justifies the continuous-time model (2.8 ) 
as a limit of discrete-time models in which recombination acts on a faster time scale 
than mutation and selection. We use the notation [a;J to denote the greatest integer 
less than or equal to the real number x. 



Theorem 5.4. Let (pt)t>o be the measure-valued dynamical system of (2.8) 



whose existence is guaranteed by Theorem\2.J\ Suppose in addition to the hypotheses 



of Theorem 2.4 that the pair (TZ, v) (respectively, (TZ, po) ) consisting of the recombi- 
nation measure and the mutation intensity measure (respectively, the recombination 
measure and the initial intensity) is shattering, the initial measure Pq is Poisson 
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(with intensity pq), and the selective cost S is bounded with a — sup geg S(g). Then, 
for any T > ; 

lim sup ||n pt -Q| trl j|| =0. 

™^°°0<t<T was 

This result is proved in Chapter [8] as a consequence of a similar result in Chap- 
ter [6] in which the partially Poissonizing action of recombination is replaced by 
complete Poissonization in each generation. Although it is intuitively reasonable 
that the effect of recombination should be asymptotically indistinguishable from 
that of complete Poissonization when the per-generation effect of selection is suffi- 
ciently small, the technical difficulties are substantial, and we collect some necessary 
preliminary results in Chapter [7] 

Remark 5.5. We have assumed throughout, for notational convenience, a par- 
ticular order of operations: In each generation there is first selection, then muta- 
tion, then recombination. This order has no special significance, and it is easy 
to see that the proofs would hold equally well for another order, or if the same 
total amounts of mutation and selection were split up into multiple bouts within a 
generation, whether before or after recombination. 



Remark 5.6. Because of Remark 5.2 the hypotheses of Theorem |5.4| imply 



that the mutation intensity measure v and the initial intensity po are both diffuse. 
It follows easily from this that each probability measure Qk assigns all of its mass 
to the set of elements of Q that have atoms of mass one; that is, every Qfc is the 
distribution of a simple point process. 

Prom now on, we assume without further comment that all the prob- 
ability measures on Q we consider are distributions of simple point pro- 
cesses. 



CHAPTER 6 



Convergence with complete Poissonization 



Recall from Notation 4.4 that Q k := (DlTl n 6 n ) k P . Theorem 5.4 says that 



under suitable conditions, Q\t n \ is close to H Pt when n is large. The intuition 
behind this result is that %\ drives distributions on Q towards Poisson faster than 
6„ drive ones away. As a first step in the proof of this theorem, we prove a simpler 
result in which the operator £H is replaced by the operator that immediately replaces 
a distribution on Q by the distribution of a Poisson random measure with the same 
intensity measure. This latter proposition is essentially a shadowing result, but none 
of the standard shadowing theorems, such as those in [CKP95J, can be applied as 
stated to this setting. In any case, the direct proof is not difficult. 

Notation 6.1. The complete Poissonization operator <p acts on a probability 
measure P on Q by tyP := H^p. That is, tyP is the distribution of the Poisson 
random measure with the same intensity measure jiP as P. 



Notation 6.2. Set 



Q'k ■■= (ym n 6 n ) k p 



and 



TTfc := pQ'k- 

That is, Q' k the distribution of the Poisson random measure resulting from k itera- 
tions of mutation and selection of intensity 1/n, with complete Poissonization after 
every round, and ~Kk is the corresponding intensity measure. 

Proposition 6.3. There are constants A,B,C, depending on K, v{Ai), and 
pa(M), such that for every T > 

k L *»J - <°*Hwas < e AT (BT + C)n-\ 



sup | 

0<t<T 



In particular, 



lim sup 

n ^°°0<t<T 



n Pt - Q\ 



\tn\ 



= 0. 



Was 



PROOF. For any < s < t, 



Pt 



I Was 



< ll(t-*Hlw„ + 



F Pu p u du 



Was 



The first term on the right is simply (t — s)||i/||was = (t — s)v(M). For the second 



term we use Lemma 2.5 and the triangle inequality, together with the universal 
bound ||pu||was < ||po||was + u|Mlwas, to obtain the bound 

2K I \\Pu\\ Was du<2K{t-s)(p (M)+tv(M)). 
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Putting these bounds together, 

(6.1) \\ Pt - Ps || Was < (t- s)u(M) + 2K(t - s)(p (M) + tv{M)). 

Note that 



(6.2) 

so that 



d(7r ro+ i - u/n) . , E [e- S ^' m +^)/™] 



dit n 



(6.3) 



7T m +l 7T m 1 F* m {') 



E \ e -S(X-m)/n] 



Was 



< n- 2 ■ 2e 2K/n (E[S(X^) 2 ] + K 2 ) exp {E[S(X^)/n}} . 



Next, 



7T m I 1 F„. 

n 



?r m 1 - -F 



P m /r 



Was 



(6.4) 



< - sup I F 7fm ( a: )- F p m/n (ar)| 
8F,, I, 

- ~^~IF«» _ Pm / n II Was 



by Lemma|2.8| and 



(6.5) 



7T m I 1 - -F 



Pm/n I Pm/n 



l - -F 



5: 1 1 I'm Prre/ra| 



Was 



Was 



since F p {x) is always nonnegative. Finally, 



P(m+l)/n Pm/n I 1 F f 



PmA 



Was 



< 



,1/n 

/ ^Pm/nP m / n ^Ps + m/nP s -\- m / n ^ S 

Jo 



Was 



1/n 



< / sup \ F p (x) - F p 

s + m/n ( X ) | || Pm/n+s 1 1 Was ^ S 



zGAI 



(6.6) 



,1/n 

+ SU P F P m/n ( x ) / 1 1 Pm/n - Ps+m/n 
x£M Jo 



I Was 



< 



1/n 



8Fs 



m + 1 



i/(A4) + po(M) 



F 



,1/n 




i 


/O 







Was 



1\ m+1 



< Fn~ 2 f 4+ i ) '^^(u(M) +Po(M)) +Kv{M)n- 2 . 



2) n 

Thus, there are constants a and b such that 



Hence, 



ii i ^ a n n bT + c 

IFm+l — P(m+l)/n||Was < — ||7r m — /0 m /n||Was H 2 — ' 



I 11 ^ -1 aT bT + C 
Fm _ Pm/n II Was S Tl e . 
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Combining this inequality with the inequality (6.1), the proposition follows imme- 
diately □ 



CHAPTER 7 



Preparatory lemmas for the main convergence 

result 



We introduced the recombination probability measure 1Z as the distribution of a 
random subset of M. with the property that the subset and its complement have the 
same distribution, and we think of the subset and its complement as segregating set 
of loci. These two sets form a 'partition of M. and, with a slight abuse of notation, 
we will use also use the notation 1Z for the distribution of this random partition 
of M. into two sets. In the same vein, we define the probability measure IZk on 
partitions of M. to be the distribution of the coarsest random partition that is finer 
than each of the independent partitions {Rj,R^}, 1 < j < k, where each partition 
{Rj, Rj} has distribution 1Z. That is, TZk is the distribution of the partition of M. 
consisting of the non-empty sets of the form R\ n • • • Rk, where Rj is either Rj or 
Rj. Using standard notation, we write this partition as {R%, R°} A • • • A {Rk, R%}- 
Note that the number of sets in this partition is at most 2 fc , but may be smaller. 

With this notation, the product of k copies of the recombination operator 91 
can be written as 



where, for a finite partition A = {A±, . . . ,Ak} of M into Borel sets, the annealed 
recombination operator 9^4 is the operator on probability measure on Q defined by 



for a probability measure Q on Q and a function F : Q — > R. The annealed 
recombination operator should be thought of as representing the combined effect of 
some number k of successive recombination events involving specified segregating 
sets Ri,...,R k such that A= {R 1 , R\] A ■ • • A {R k , R%}. Note that if A and A' 
are partitions, then ^a^A' = V^AaA'- 

Given some probability measure P on Q, the relation ( |7.1| indicates that in 
order for 9\ k P to be approximately Poisson with intensity /iP for large k it will, in 
general, be necessary that a partition of M. distributed according to TZk typically 
consists of sets that all have small [iP mass, so that a sample from 9 c l fc P consists of a 
mosaic of many small pieces each taken from genomes sampled independently from 
the population described by P. The following is a convenient way of measuring the 
extent to which a partition of M is made up of sets that each have small mass with 
respect to some reference measure. 



7.1. Consequences of shattering 



(7.1) 





F( 9 % + ■■■ + : g¥j)dQ(gW)...dQ( 9 W) 
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Notation 7.1. Suppose that A = {A\, . . . , Ak) is a partition of M, and A a 
measure on M.. For r > set 

k 

\A\i X \t) :=^A(A:) r . 

i=l 

Recall the definition of a shattering pair from Definition |5.1| 

Lemma 7.2. // the pair (1Z, A) is shattering with constant a, then for all k > 1, 



J \A\ { 2 X) dK k (A) < 



k + 1' 
where 

a* = X(M) 2 V 2a\(M). 

PROOF. Define a sequence of random partitions ,4o,,4i, ■ ■ • of M. by setting 
Aq = {M} and Ak = {Ri,R°} A ■ • • A {R k ,R c k } for k > 1, where the partitions 
j > 1, are independent and identically distributed with common distri- 
bution ft. Set X k = \A k \ { 2 ] . Then, 



J \A\ {X) dK k {A) =E[X k ]. 



By the symmetry of 1Z, 



E [Xfc+i | A, ■ ■ • , A k ] = f I X(A n R) 2 + X(A n R c ) 2 j dTZ(R) 

= 2 I KA^R) 2 dR{R) 
AeA k J 

< Y HA) 2 (l-aX(A)) 

A&A k 

= \A k \i X) -a\A k \i X) . 
It is always the case for any partition A of M. that 

> (i^iiTWV = hm)- 1 ^) 2 . 

Thus, setting c = 2/aX(M), 

E[X k+1 \A!,...,A k ] <X k {l-cX k ). 

Applying Jensen's inequality to the concave function x(l — cx), we see that 

(7.3) E[X k+1 ] < E[X k (l - cX k )] < (E[X k ])(l - cE[X k }). 

We complete the proof by induction. We have X = X(M) 2 . Suppose that 
E[X k ] < a*/(k + 1). Since a* > 1/c, 

Ct* / COL 

E[^ + i]<r^T 1- 



k+ 1 V k+1 
a* k 
~ k+1 ' k+1 

a* 

< 



k + 2' 

□ 
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The following corollary is immediate. 

Corollary 7.3. Suppose that the pairs (TZ,po) and (7?.,^) are both shattering 
with the same constant a. Let v t = tv + po- Then, for all k >\, 



\A\i Vt) dTZ k (A) < 



a 



1 



whe 



a* := {2v t {M) 2 ) V {Aav t [M)). 

7.2. Estimates for Radon-Nikodym derivatives 

Let X be the canonical random variable on Q\ that is, X is defined by X{g) = g. 
Given a probability measure Q on Q, a Borel subset A of Ai, and a measure g £ Q, 
we may define a regular conditional distribution Q[- | = g^jj. 

Recall the definition of the annealed recombination operator 9\ A from (7.2). 

Lemma 7.4. For a finite partition A of M into Borel subsets and a probability 
measure Q on Q , the Radon-Nikodym derivative of the probability measure 9\ A & n Q 
with respect to the probability measure 9\ A Q is given by 

d9\ A 6 n Q 



dSH A Q 



-SQ,A/n 



where 



Sq,a(9) 



E ( lo sQ 



e -S(X)/n | x = 



9A 



logQ 



-S(X)/n 



AeA 



[0,1] 



Proof. Write A = {Ai, . . . ,Ak}- By definition, for any measurable F : Q 
], 

(m A e n Q)[F} = j ■ ■ ■ j f (<#> + • • • + ffg?) d& n Q(g (1) ) ■ ■ ■ d& n Q(g (K) ) 



Q 



-S(X)/n 



K 



^(E*2 



x ex P j" E ■ ■ ■ dQ(s W ) 



-S(X)/n 



F{g) 



' K 

n 



(9UQ)[e" 



,-S(X)/n 



dQ(X X Ai =g A .) dV\ A Q{g) 



-S<3,A/n 



F]. 



a 



Lemma 7.5. For a probability measure Q on Q, a partition A of M. into Borel 
sets, and measures g,g' € Q, 

Sq,a(9) - SqM) < <9{M) + g'{M)). 
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PROOF. We may write 
Sq,a(9) = -n J2 (logQ [ e - s (^)/« | X A = g A ] - logQ [ e - s «/« | X A = 

2 (log Q [e- s W/" | X A = oj - log Q [e"^)/"] ) . 



AeA 

- n 



AeA 

Consider the summand in the first sum corresponding to the set A in the partition 
A. If gA = 0, then the summand is 0. If gA > 0, then the summand is bounded 
above by 

logQ[e- s W/"l{X4 - g A }}+logQ[l{X A = 0}] 

- logQ[e- s ^ n l{X A = 0}] - logQ[l{X4 = g A }\ 

< logQ[e°l{X4 = g A }\ +\ogQ[l{X A = 0}] 

- \ogQ[e-°/ n l{X A = 0}] - logQ[l{XA - 9A }} 

< a/n, 

and, by a similar argument, it is bounded below by —a/n. Thus, the first sum is 
bounded in absolute value by er#{A e A : g(A) > 0} < ag(M). 

Note that Sq : a(9) — Sq,a{9*) is the difference of the first sum and the corre- 
sponding quantity with g replaced by g'; that is, the second sum and its counterpart 
for g' coincide and hence cancel each other in the difference. □ 

For some purposes, it will be useful to keep track of the temporal order in which 
mutations appeared in genotype. We denote by Q* the space of finite sequences of 
genotypes. That is, 

(7.4) g* -=gug 2 u--- , 

where U denotes disjoint union. We think of an element of g % as recording the 
mutations from the ancestral wild type that appear in each of i consecutive gen- 
erations. There arc natural "projection" operations T, : g* ^ g defined for 
g = (gi, . . . ,<72,So) <= (it will be convenient for us later to index sequences 

in reverse order) by 

: = 5o + .92 H h g% 

and 

9j, 0<j<i, 
0, j > i. 



In essence, £ removes the "labels" that record in which generation the various 
mutations from the ancestral wild type occurred, while ^ j isolates mutations that 
occurred in generation j. Note that S = X)j*Lo 

A probability measure Q on g* may be thought of as a sequence of sub- 
probability measures (Q^)f^ , where QW i s the portion of the Q concentrated 
on i + 1-tuplcs. Thus, J2ito Q^iQ t+1 ) = 1- We interpret Q as the distribution of 
a random sequence (X/, . . .,X 2 ,X ), with Q {i \g i+1 ) being the probability of the 
event {I = i} that the sequence has length i + 1 and Q( l \-) / Q^ l \G l+1 ) being the 
joint distribution of (X i7 . . . , X 2 , X ) conditional on this event. 
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Given a probability measure Q on Q*, we define, with a slight abuse of notation, 
probability measures T,Q and ^SfjQ on £ by 



for a Borel function F : Q — > M + . 

We now define analogues 971*, 6*, fR*, and ^P* of the mutation, selection, 
recombination and complete Poissonization operators 9Jt„, 6„, !tH, and with £7 
replaced by Q* . 

Notation 7.6. Consider a probability measure Q on Q* thought of as the 
distribution of a finite sequence (Xj, . . . , X a ) of random measures. 

Define the probability measure ?Bl* n Q on Q* to be the distribution of the fi- 
nite sequence (Xj + i,Xi, . . . ,X ), where Xj + i is a Poisson random measure with 
intensity measure v/n that is independent of (Xj, . . . , Xq). 

Similarly, define a new probability measure S* Q on Q* by setting 



where the segregating set R is a pick from 1Z, J is independent of R and distributed 
as /, and, conditional on J = i, (Xj, . . . , X' ) and (X'J, . . . , Xq) are independent 
picks from the distribution of (Xj 7 . . . , Xq) under Q conditional on I = i. 

The probability measure is the distribution of (Yj, . . . ,Y n ), where J is 

distributed as I and conditional on J = i the random measures Yj, . . . ,Yo are 
independent and Poisson, with the intensity measure associated with the conditional 
distribution of Yfe being the same as the intensity measure associated with the 
conditional distribution of Xk given I = i for < k < i. 

Observe that we have the four intertwining relations 



These equalities confirm that "starred" operators agree with the "unstarred" ones 
once the labeling that identified in which generation a mutation occurred is re- 
moved. 

We will think of our initial condition P as a probability measure on Q* that is 
concentrated on sequences of length one. 

Notation 7.7. Put 



(SQ)[F] := Q[F o £] and (VjQ)[F] := Q[F o 



(6* n Q)^[F}: 



/ gi cxp{-S(S( g ))/n}F(g)dQW( g ) 
f g , C xp{-S(X(9))/n}dQ(g) 

Q[exp{-S(Xj + ■■■ + X^/njFjXj, X )1{I = »}] 
Q[exp{-5(X 7 + --- + X )/n}] 



for a Borel function F : Q* — > R + . 

The probability measure d\* Q is the distribution of 

((x'j) a + (x^ R c,...,(x' ) R + (xS) R c), 



(7.5) 



P k := (Wl n ) k P . 



Set 



(7.6) 



P* k := P„. 
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Note that both probability measures Q* k and P£ are concentrated on sequences 
of length (k + 1). Note also that if (X/,, . . . , X ) is distributed according to P k , then 
Xi, . . . , X k are independent Poisson random measures, each with intensity measure 
v/n, and Xq is independent with distribution Pq. As we expect, Q k — SQ*, and 
P k = SP*, where we recall that Q k = (9Hrt„6„) fc P and P k := (Wl n ) k P . If 
Pq — n po (that is, Pq is the distribution of a Poisson random measure with inten- 
sity measure pPo = Pq), then Pi is the distribution of a (k + l)-tuple (Xk, ■ ■ . , Xq) 
of independent Poisson random measures on Ai , where Xj has intensity measure 
fj&jP? given by pQ when j = Q, and i>/n otherwise. In this case, P k is the distri- 
bution of a Poisson random measure with intensity — v + Oo- 
lemma 7.8. For a positive integer k, and sequence of measures (g k , ■ ■ ■ , go) £ 



dQl 
dPi 



(9k, ■ ■ -,9a) 



exp 



k-1 

E 



S Qi ,Aii9i + ••■ + <*>)> dft(Ai) • • • dK(A k ), 



where the partition Ai in the integral is {Ai + i, A^ +1 } A • • • A {A k , A k } 



PROOF. This follows by repeated application of Lemma [7. 4[ and the fact that 

□ 



91* commutes with 9Jt* . 

Lemma 7.9. Suppose that P 
o~k 



n„ 



exp 



< 



Po . For P k -almost every g £ Q* 
Xg(M) - (e° k ' n - l)pP k (M) 



dQl 
dP* 



(9) 

— Vg(M)+(l-e-° k l n )iiP k (M) 



< exp 



Proof. Recall that pP k = po + —v. By Lemmas 



7.5 



and 



log 



dQt/dP;(g) 



< 



dQl/dP^O) 
akZg(M) 



< — sup sup 

n A 0<i<k 



for any g £ G* , 



9i) 



Thus, 



^(9) 

dPt yy> 



dQt/dP*(g) 
dQ'JdPi(0) 



{ dQl/dP*(h) , px(h \ 
lg dQ*/dp;(o) ar k v L ) 



< 



e 



akT,g(M)/n 



Jg* 
= exp 

The lower bound is similar 



e -akT,h(M)/n dP*(h) 

k 



a-Xg(M) 
n 



l>{( I 



pPk(M)} 



a 
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For the next result, we recall Campbell's Theorem |DVJ88[ (6.4.11)], which 
says that 



F(g)g[f]dU^g)= / IL n [F(- + 8 x )]f(x) dn(x) 

G JM 

for a finite measure 7r on M. In particular, 

/ g[h] exp{-cg(M)} <SL K {g) = exp{— (1 - e~ c )n(M)} / exp{-c}h(x) dix(x) 
Jg J m 

for any constant c > 0. 

Corollary 7.10. Suppose that Pq = H po . For < j < k < oo and /iP k -almost 
every x £ Ai, 



exp 



< exp{ (e**/" - e- ak / n )^P k (M) + afc/n}. 



Proof. By the first inequality of Lemma |7.9[ we have for a Borel function 
h: M —> R+ that 



Jg* 

= J ^ j g[h]^{g)dP^g) 

> * jg [h]exp{-^g(M) - {e° kln - l)^P k {M)} dP* k {g) 
^{-(e^ n ~l)^P k (M)} 

[^ j9 [h]exp{-^ jg (M)}eKp{-^J2%9(M)}dP^g) 
,{-(e"*/»_i)^( M )J 

<exp{-(l- e - fe /")/x%P fe *(A4)}y exp{-^}/^)d/^'P fc *(z) 
< exp{-(l - e-^/") ^^.p*(M)} 

,|_( e «r*/« _ e- ak ' n )^P k {M) - ak/nj^jP^h}. 



exp< 



x 



expi 



The first inequality of the result follows. The proof of the second inequality is 
similar. □ 

The next result is an elementary consequence of Corollary |7.10| 
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Corollary 7.11. Suppose that Pq = LT po . For < k < oo and /iPk-almost 
every x S M, 



|_( e ^/n _ e - ak/n )^P k {M) - ok/n} 



exp 

< exp{ {f k / n - e- ak / n )fiP k (M) + crfc/n}. 

7.3. Palm measures 

For various probability measures Q on Q, we will need to understand something 
about the probability measure that corresponds to conditioning a pick from Q to 
place a point at the locus x € M. When the intensity measure /iQ is finite, the 
object that makes rigorous sense of such conditioning is the corresponding Palm 
measure Vq^ . For a Borel set A C Q, Vq\A) is the Radon-Nikodym derivative 

where /i^Q is the finite measure on .M defined by 



Ja 



(see, for example, [Kal84 (1.7)]). The Palm measure is defined for /iQ-almost every 
x E A4. If Q is the distribution 11^ of a Poisson random random measure with finite 
intensity measure tt, then Vq\f) = Q[F(- + S X )} for Tr-almost every i£M (see, for 
example, |DVJ 88. Example 12.1(b)]). It follows from the definition of the Palm 
measure that if F : Q — ► M. + and / : A4 — > R + are Borel functions, then 



F(g)g[f]dQ(g)= [ V ( Q x) [F}f(x) dpQ(x). 
Jm 



Note that Campbell's theorem follows from applying this observation when Q is 
the distribution of a Poisson random measure. 

Corollary 7.12. Suppose that Pq = iI Po . For every positive integer k, fiQk- 
almost every x £ M (equivalently, p,Pj~-almost every x 6 M. or (po + v) -almost 
every x € M), and every Borel function F : Q —> K + , 

V%l\F] < exp |^ + {e° k ' n + i _ 2 e - CTfc /">P fe (X)| 
x J F(g + 5 x )exp!^g(M)} dP k (g), 



Proof. By Corollary 7.11 



(7.7) - ex p{( eCTfc/ " " e-° k / n )fiP k (M) + ak/n] 

for (po + f)-almost every x. 

Consider Borel functions F : Q — ► M + and / : M. — > R + . From the second 
inequality of Lemma 7.9 the inequality (7.7), and the characterization of the Palm 
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measure of a Poisson random measure we have 



o!) 



I M 



V ( £[F]f(x)d^Q k (x) 



= [ F(g)g[f]dQ k (g) 
Jg 



F(g)g[f]^(g)dP k (g) 

F(9)^(g)dvS(g))ji.r) l i ll P lA r ] 



< j M (J F(g + 6 X ) exp j + 4) (M) + (l - e^ k / n )^P k (M) \ dP k (g) 

x f(x)dfiP k (x) 
ak 

exp 



.vi 



(l-e- fffc /")//P fe (M)j 



< exp 



n9 + 4)exp|^(.g)(X)| dP fe (g)) f( x )^^( x )dvQ k {x) 
^ + (1 - e- CTfc /") M P fc (M)| exp{(e CTfe /" - e - CTfc /'>P fc (.M) + ^} 



.vi 



P(.g + 4) exp \ —(g)(M) } dP k (g) ) f(x) d[iQ k (x) 



exp 



x 



F( 5 + <y exp<j — (<?)(7W) [> dP k (g) ) f(x)dnQ k (x), 



iM \Jg 
and the result follows. 



□ 



7.4. Comparisons with complete Poissonization 



Recall from Notation 6.1 that the complete Poissonization operator *|3 acts on 
a probability measure P on Q by ^jiP := II^p. Also, recall from Notation 6.2 
that Q' fc := {ty& n yfl n ) k Po is the an alogu e of the sequence of probability measures 
Q k := (9 c lS„2Jl„)' c Po from Notation 4.4 that is of primary interest to us in Thco- 



rcm |5.4[ with the recombination operator replaced by the complete Poissonization 
operator *p. Finally, recall from Notation |7.7| that the counterpart of Q k for Q* 
(that is, for the setting in which we keep track of the generation in which mutations 
from the ancestral wild type occurred) is Q* k = (5H*6* Wl*) k P . 

The next result is a consequence of Corollary |7.10| and the observation that 
if tt' and it" arc two finite measures on M and it' is absolutely continuous with 
respect to it" , then 



dn^ (5) = exp ^ 



log 



dTT 1 

drr 7 ' 



tt'(M) + tt"(M) 



(ill 
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Corollary 7.13. Suppose that Pq = Tl Po . For P£ -almost every g £ Q* , 
exp{(-(e CTfe /" - e-° k / n )vP k (M) - ak/nj Eg(M) - nQk(M) + (iP k (M)} 

~ dP* w 

< exp{ ((e fffe /" - e-° k / n )[iP k (M) + vk/nj Eg(M) - nQk(M) + fxP k (M)}. 

Lemma 7.14. Suppose that Q — 11^ for some finite measure n on M.. The 
probability measure ty&nQ on Q is absolutely continuous with respect to Q with 
Radon- Nikodym derivative 

(7.8) WM {9 )=e-*° W/n , 
where 

[ h(M)e~~ sl > h ^ n dQ(h) C 

(7.9) S' Q (g) := g[s Q ] - ^ j ^sw/njQ^) + n J g h ^ d ^ 

with 

s Q (x) : = -n log J e - s (' l )/« dV ( Q } (h) + nlog J e - s ^' n dQ(h) 

(7.10) = -nlog f e- S{h+s * )/n dQ(h) + nlog [ e - s{h)/n dQ(h) 



' nl0g ( dn {X) 



Proof. Consider (7.101. The first equality is simply the characterization of 
the Palm distribution for Poisson random measures that we have already noted. 
For the second equality take a Borel function <fi : Ai — ► M+ , and define <f> : Q — > M + 
by $(<?) = <?[/]• Then, 



= & n Qm 



Q[e~ s / n ] 
v (*)\ e -s/ n] 

4>{x) dir(x) 



by Campbell's Theorem, and the equality follows. 
Next consider ( fflj) . By ( |7.10| , 

-(5) ~ 



dQ yj/ J e -' l [ s «/™] dQ(h)' 
Now, 
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and, by another application of Campbell's Theorem, 



r s Q (x)/n _ {j = 



■ dir(x) 



,-k(M) 



Je- s ^ n h(M)dQ(h) fh(M)dQ(h) 
f e-W» dQ(h) 



Putting these equations together proves (|7.8|). 



□ 



The analogue of Q k with the complete Poissonization operator ^* replacing 
the recombination operator 91* (equivalently, the counterpart of Q' k for Q*) is 

Q'C := (r e* n M* n ) k P . 

As expected, we have the intertwining relation SQ fc * = Q' k . 

Recall from Definition |2.3| that if tt is a finite measure on M. and x£M, then 
F n (x) := E[S(X n + S x ) — 5(X lr )], where, as usual, X* is Poisson random measure 
on M. with intensity measure 7r (that is, has distribution 



PROOF. Observe that 



\sq(x) - F„(x)\ < n 



log 
log 



dQ(h) 



S{h + 5 x )/ndQ{h) 



,s(h)/n d Q( h j_ I S (h)/ndQ(h) 
and the result follows from Lemma [A. II 



□ 



Recall from Lemma 



7.4 



that dd\A& n Q / dS^AQ — ex p{ — Sq^/u], where Q 
is any probability measure on Q, A is a partition of M into a finite number of 
Borel s ets, is the annealed recombination operator of (7.2 1, Sq : a is defined in 
Lemma 



7.4 



In particular, Sq.a depends implicitly on n. 



Corollary 7.15. Consider a finite Borel partition A o/9Jt. Suppose that Q 
a probability measure on Q under which the masses of sets in the partition A are 
independent. Let Q be defined to have dQ/dQ = cxp{— Sq,a/ii\. Then, 



d^Q 
d%Q 



-S' a /n 



Proof. By assumption, Q = 9l^Q. By Lemma 7.4 we may write 

Q = K A 6 n Q 

Thus, 



d^Q = dSp&nQ 
d^Q ~ d^Q 



-S'^/n 
e Q 



by Lemma |7.14| 



□ 



Lemma 7.16. There is a constant n depending on a := sup gg g S(g) but not on 
n such that for a probability measure Q on Q, a finite Borel partition A of M., and 
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a point g € Q , 

s Q,a(9) ~ S' Q (g) 



<«( + -Q[X(M)] 

1 n n 



Q{x(A) > o} 2 + ]T Q\ x ( A )Hx(A)>2 } ] 



AeA 



AeA 



Q{X(A(x)) > 0} dg(x) + I V { q ] {X{A{x)) > 2} dg(x) 



M JM 

+ g L e M : Q{X(A(x)) > 0} > U + #{A e ^ : > 2}) , 

where A(x) C M is the unique set in the partition A that contains the point x G M.. 

PROOF. We begin by establishing a bound on the difference between SQ^(g) 
and the quantity 

[Q[S\X A = g A ] -Q[S\X A = 0])-T Q , 

AeA 

where 

T Q := Q[S(X)X(M)] - Q[S(X)] - Q[X(M)}. 
By definition (see Lemma [7. 4[ ), 

Sq,a{9) 



E ( lo sQ 



AeA 



-S{X)/n 



Xa = gA 



(logQ \e- s W n I = 



AeA 



logQ 
-logQ 



e -sW/«|X(A) = o]) 



-S(X)/n 



In the first sum, a summand can only be nonzero when g(A) > 0. By Lemma A.l 
the first term is is thus approximately 

E (Q W I = 9a] - Q [S(X) | X(A) = 0] ) , 

Ae.4:g(A)>0 

with an error bounded by 



la 2 



2n 2^ 6 ' T/ " 3( - M) 



9{M). 



The second term requires more care, because the sum extends over all (the 
potentially very large number of) sets of the partition A. Fix one of these sets A. 
Then. 



logQ 



(7.11) 



-S(X)/n I X (jy = o 



io g g 



-S(X)/n 



log 



Q le-sm/"l {x{A)= o } ] 
< [ e s(x)M Q{X(A) = 0}' 



Note for y > 1 that 
(7.12) 0<(y-l)-lo g2 / 



y 1 / I 

l--<fa< fl -- ] ('/ -1) 
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and a similar argument shows that the same bound holds for < y < 1. That is, we 
have the approximation log(y) f=a y— 1, y > 0, with error bounds y — 1 — (y — l) 2 /y < 
log(y) < y- 1. 

Applying this approximation to the right-hand side of (7.111, the first-order 
term is 



Q [e- s ^/"l {W0} ] 
Q[e-sm/n}Q {X (A)=0} 



1 



(7.13) 



Cov Q [e- s W n ,l {x(A)=0} ] 

Q[ e -S(X)/„]Q{ X(A) = } 

Cov Q [l~e- s W/",l {x(A)>0} ] 
Q[e-W«]Q{I(i)=0} 



~ n COVQ ^ ( - X ^ 1{X(A)>0} ^ 
The difference between the second and third quantities in ( 7.13[ ) is 

Q [(i _ e s(x)/n _ s{x)/n _ Q [l _ e -S(X)/n „ 5(X )/n]) 1 { X(A)>0}] 
Q[e-W»]Q{Jf(A) = 0} 
(7.14) + Q[(S(X)/n-Q[S(X)/n])l 

1 

Q[e-5(X)/"]Q{X A - 0} 



From the inequalities —y 2 /2 < 1 



(7.141 is bounded by 



- 1 

v — 2/<0, y > 0, the absolute value of 
Q{X(A) > 0}\ 



n Q{X{A) = 0} 



This bound is only useful when Q{X(A) > 0} is small, so we further note that the 
absolute value of ( |7.14 1 is also bounded by 



(7.16) 



Cov Q [l- e - s (*)/",l {x(A)=0} ] 



e- s ( x )/ n \Q{X(A) = 0} 
a a 



\Cov Q [S(X),l {x{A)>0} }\ 



n n 

The contribution coming from the bound (y — l) 2 /y on the error in the first 
order approximation log(y) ~ y — 1 is bounded by 



' Cov Q [e- g W/",l {x(A)=0} ] \ Q[e- g PO/"]Q{X(A) = 0} 
Q[e-s(x)/n]Q{ X (A) = 0} i Q[e^(x)/»i 



{X(A)=0}J 



(7.17) 



Cov Q [l-e- s ( x )/",l {x(A)>0} ] : 



Q[ e -s(x)/n] Q{X(j4) = }Q [e-W»l Wj , w ] 



< e 2CT/ " 



a 2 Q{X{A) > 0} 2 



n 2 Q{X(A) = 0} 2 ' 

Once again, this bound is only useful when Q{X(A) > 0} is small, so we further 
note that the left-hand side of (7.171 is also bounded by 

T 2 



(7.18) 



,2a /r, 



ri 



2 ' 
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Putting these bounds together, we see that 
- n (\ogQ\e- s{x]/n \X{A) = 

~ CoVq[S, 1{X(A)>0}]) 

with error bounded by 



(7.19) 



logQ 



,-S(X)/n 



e a/n — Q{X{A) > 0} 
n 

(7.20) + e a / n aQ{X(A) > 0} (~ + 2Q{X{A) > 0}) 

+ 4e 2r7/n — Q{X(A) > 0} 2 , 
n 

when Q{X(A) = 0} > 1/2, and by 
(7.21) 



e"l n a — I — cr — I — e 2a/ ' 



n 



otherwise. 

Note that 
(7.22) 

T Q -J2Cov Q [S(X),l {x(A)>0} ] 
AeA 



J2Cov Q [S(X),X(A)-l {x(A)>0} ] 



AeA 



<aJ2 Q[X{A)1 {X{A) > 2} ] 



AeA 



Summing (7.191, (7.201, and (7.21 1 over A G A and combining the result with 



(7.22 1, we see that 

(7.23) SqM ~ {Q[ s \ x a = 9a] -Q[S\X A = 0]) -T Q 

AeA 

with an error bounded by 
(7.24) 

2 2 

e a ' n —g(M) + 2e er / n ~Q[X(M)] 
n n 

2e°/ n a + ±e 2aln — + 4 (V /? V + <r + e 2<T/n — \\ Q{ X ( A ) > 0}' 



AeA 



+ aJ2Q[ X i A ) 1 {x { A)> 2} ], 
AeA 

where we used the inequality 



l{Q{X(A) = 0} < 1} < 4Q{X(A) > 0} 2 



We now bound the difference between S'g^g) and the approximation to SQ^ A (g) 
in (7.23). We stress that here we are not assuming that Q is the distribution of a 
Poisson random measure, and so, in the definition of £q(<?) from (7.9 1, namely 

S' Q {9) = g[» Q ] - [ £? s(x)/n] 1 + nQ[X(M)], 
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the definition of the function sq is the one appearing in the first line of (7.101 
namely 



sq(x 



(x) = -n\ogV { n ] [e- s W n ] + nlogQ[e- s W/"], 



rather than the definitions on the other lines of (7.101 that are only equivalent in 



the Poisson case. The absolute value of the difference is at most 



(7.25) 



E \9A[s Q }-{Q[S\X A =g A ] -Q[S\X A = 0])\ 

A<=A:g(A) = l 

+ E \9A(s Q )-{Q[S\X A =g A ]-Q[S\X A = 0])\ 
AeA:g(A)>l 

0\X(M)e~ si - X)/n ] 
- n Jffxj/n] l +nQ[X(M)] + T Q 



We will consider each of the terms in (7.25 1 in turn 



Note first of all that, by Lemma A.l 



logQ[e~ 5(x)/ri ] + Q[S(X) \X A = 0] 
Q[S(X)1 {X{A)=0} ] 



< 



(7.26) 



Q[S(X)} - 



Q{X{A) = 0} 



o a / r ' 



2n 



Q{X(A)>0} a/n ^_ 
~ Q{X(A) = 0} 2n 



< 2aQ{X{A) > 0} + e 



<7/nfl_ 

2n 



if Q{X(A) > 0} < |, and otherwise the left-hand side is bounded by a. 

Note next, that, by definition of the Palm measure, for any Borel subset B C Q 
we have 



(B) -Q(B\X A = 5 X ) < V y o>{X{A) > 2} 



for /iQ-almost every x £ M.. Combining this observation with Lemma A.l gives 



(7.27) 



nlogpgV 5 ^ 07 "] + Q[S(X) | X A = 5 X ] 

2 

< e a/n ^ n + aV { Q ] {X{A) > 2). 
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Integrating the inequalities (7.261 and (7.271 against the measure g A and sum- 
ming over A yields that the first sum in (7.251 is at most 

Y\ [ \a Q {x)-(Q[S\XA = S x ]-Q[S\X A = 0])\dg A (x) 



AeA:g(A) = l 



< ]T f 2aQ{X{A) >0} + e^ n ^- + al[Q{X(A) > 0} > H 



AeA:g(A) = l 



(7.28) 



aP { Q } {X(A)>2}dg A (x) 



2n 



2 

< e a / n —g(M) 



+ / 2aQ{X(A{x)) > 0} + aV { Q } {X(A(x)) > 2} dg{x) 



M 



(7.29) 



+ ag \x e X : Q{X(A(x)) > 0} > 

Turning to the second sum in ( 7.25[ ), we have 

AogV { Q x) [e~ s ^/ n ] + Q[S(X)\X / 



9A 



< a. 



Combining this with (7.261, integrating with respect to g A , and summing over A 
yields that the second sum in (7.25) is at most 

j/r, 



2n 



i ■■■—g(M)+ 2aQ{X(A(x)) >0}dg(x) 



M 



aglxeM: Q{X(A(x)) > 0} > 



(7.30) 



+ a#{AeA:g(A)>2}. 
The third term in ( 7.25[ ) is 
(7.31) 

~ U frU] + n ^ M ^ + Q[X{M)S{X)] - Q[X{M)]Q[S{X)] 



< 



(Q[X{M)e~ s W n ] - Q[X{M)]Q[e- S{x ^ n 

n [Q[X{M)e- s{x ^ n ] - Q[X{M)]Q[e- s{x ^ n 
- (Q[X(M)S(X)] - Q[X(M)]Q[S(X)]) 

:{M) - Q[X(M)})(1 - e -*«/«)] Q ^ sm/n] 
(X(M) - Q[X(M)])(1 - e - s ^/ n - S(x)/n) 



1 



Q[e-W»] 



1 



nQ 
nQ 



< Q[X(M)] (e a/n — 
\ n 



\°1 
2 n 



It follows from substituting the bounds (7.281, (7.30), and (7.31) into (7.25) 
that the difference between Sq(<?) and the approximation to S>Q, A (g) in (7.231 is 
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bounded in absolute value by 



(7.32) 



I AaQ{X(A(x)) > 0} + ap£ } {X (A(x)) > 2} dg(x) 

M 



2ag jx S M : Q{X{A{x)) > 0} > - 
a#{AeA: g(A) > 2} 



,cr/r 



2"n 



g[x(x)]. 



The result follows upon combining this last bound with the bound (7.241 on the 
absolute value of the difference between SQ tJ \(g) and the approximation to SQ^(g) 
in (f7T23l). " " □ 



CHAPTER 8 



Proof of convergence of the discrete-generation 

systems 



With Proposition |6.3| in hand, it will suffice to show for a number of generations 
k with < k < Tn that Q k = (9\20l n & n ) k Po, the probability measure that describes 
a population after k rounds of selection, mutation, and recombination, is close to 
Q'k = ( ( POT n ©„) fe P , the probability measure that describes a population after k 
rounds of selection, mutation, and complete Poissonization. 

By Lemma [2. 7| 



(8.1) 



Q k ~Q' k \\ Was <\\Qk-VQk 
< \\Qk-VQh 



I Was 



I Was 



\\yQk-Q' k 
■4\\(iQk - nQ\ 



Was 



fell Was' 



where we recall that [iQk and [iQ' k are the intensity measures of Q k and Q' k . 
Set 



a-k ■= \\Qk - VQk 



Was 



and 



b k := llfiQk - nQl 



fell Was' 



Given a Borel function / : 9Jt — ► [— 1, 1], define the function F : Q — + K by F(g) := 
g[f]. Note that 



Qk+l[F] = flQ k+ l[f] = K^n&n)Qk[f] 



"[/] 



6nQk[f], 



where we use the facts that, for any probability measure Q on Q, fiUlQ = fiQ and 
[iWlnQ = mQ + v l n - Thus, from the inequalities — y 2 /2 < 1 — e~ y — y < 0, y > 0, 



•2) 



bk+i <sup{|6„Q fc [/]-S n Q' fe [/]| : ||/|| Lip < 1} 



Qk fe 



-S/r, 



Q'k 



-S/r, 



< e 



<r/n 



\Qk[f}-Q'k\f}\ + -\Qk [S-f]-Q' k [S-f}\ 



, i 2a 2a 2 

< g { b k + —a k + — 
n n z 



2^2 

n 2 



tin 
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We now bound a k . It follows from the relations £Q£ = Q k , = and 



SP t * = P fe that 



Qfc-5PQ* 



| Was - sup{|Q fc [0] - ?PQjt[0]| : |H| Li p < 1} 
<Bup{|Q fc [0]-(PQ fc M| : Halloo < 1} 

= Bup{|Q3E[0oE]-qrQjE[^o5]]| : ii^u < i} 
<sup{|Q3E[0*]-qrOfc[0*]| = II^IU < i} 



dPt yy ' dR 



dP* k {g) 



So, from the inequality logy < y — 1, 



I Was 



< 



log 



dp* yy > & dp* 



(g) 



dPt 



dPt 



By Lemma [778 



(gk,---,go) 



= J exp j-i^S QiiA ( ffi + ■■■ + . 9o ) | dft(A 1 )---dft(A fe ), 

where the partition _4j in the integral is {Aj+i, .<4| +1 } A • • • A {A k , A^}, and, by 
Lemma 17.91 



dQ* k 



dP* 



ak 



±(g) < exp — Eg(M) + (l - e - CTfc /") M P fe (.M) 



By repeated application of Corollary |7.15| 



dPr iyfe '- 



fc-i 



.go) = exp \ -^-^2S' Q .{g 



+ 9i) 



and, by Corollary |7.13[ 

-(g) 



<m*Q* k 



dP* 



< exp { ((e CTfe /" - e- ak ' n )^P k {M) + afc/n) Zg(M) - fiQ k (M) + fiP k (M)} . 
Observe that 
^Xg(-M) + (1 - e-° k ' n )nP k {M) } 

V { ((e fffc /" - e- CTfc /")MPfc(M) + <7*/n) £<?(M) - ^Qfc(M) + ^Pfc(-M)} 

< ((e fffc/n - e-^ n )fiP k (M) + afc/n) Eg(M) + /iP fc (M) 

< (( e CTfc /« - e - ak ' n ) (po(M) + v{M)k/n) + ak/n^j Eg(M) 

+ (p (M) + v{M)k/n) 

< cEg(M) + c, 
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where c and c are constants that depend on a, T, po(Ai), and v(M). Therefore, 

\\Qk- VQk\ 



I Was 
fe-1 



■3) 



< 



< 



J i=0 



fe-1 



TJ ^ ' 



Qi^CsoH hffi) 



i=0 



exp{dY,g(M) + c} • • • dK{A k ) 



fe-i 



i=0 



exp{cg(M) + c} dP l {g) d1l k - t (A), 



where c is a constant that depends on a, T, po(Ai), and v(A4) such that 

J exp{5( 3fc _. i+1 + • • • + g k ){M) + c} dU^-\g k , g k - i+1 ) < exp{c}, 

and we recall from Section 7.1 that lZ k -i is the probability measure on parti- 
tions of M that is the push-forward of the product measure 7?®( fc-1 ) by the map 
(A i+1 , ...,A k )^ {Ai +1 ,A c i+ i} A ■ ■ • A {A k ,A%}. 



Recall from Chapter 
probability measure Q on' 



7.3 
V 



(x) 

that Vq denotes the Palm measure at x £ M of a 
'Recall also for a finite measure 7r on Ai and bounded 
functions f : M —> M + and h : M — > K+ that 



1]} 



(8.4) 



J exp{g[f}} dU n (g) = exp{ir 

J g[h}exp{g[f]}dll^g) = tt[^] exp{7r[e^ - 1]} 
From Lemma |7.16[ 

S'qM ~ S QlA (g)\exp{~cg(M) + c} dP^g) 

< f J-g(M) + -Q t [X(M)} 
I \n n 



(8.5) 



Qi{X{A) > o} 2 + Qi[X(A)l {x{A) > 2} ] 



AeA 



AeA 



+ j Qi{X(A{x)) > 0}dg(x)+ j V%{X{A{x)) > 2} dg{x) 



M 



M 



+ g U G M : Qi{X{A{x)) > 0} > - j + #{A e A : > 2}J , 
x exp{c#(.M) + c} dP l (g), 

for any Borel partition A of A4, where A(x) is the set in A that contains x. 

We next break ( 8.5 1 up into a sum of integrals and, ignoring multiplicative con- 
stants that depend on a, T, po(Ai) and v(M), estimate each integral individually. 
We will apply (8.4) repeatedly without explicit mention. We will write ci, C2, ■ • ■ for 
constants that depend on a, T, p (M) and is(M), and sometimes use expectation 
rather than integral notation. 

To begin, 

-P l [X(M)exp{cX(M)}} < -. 
n n 
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From Lemma 17.9 



-Q,lX(M)]P,lexp{cX(M)}] 



< l -P t 
n 



oxp <; — X(M) + (1 - e-^nPiiM) 



Pi[exp{cX(M)}] 



< 



Again from Lemma 7.9 



i{X(A)>0} 
< Qi[X{A)} 



< Pi 



X(A) exp ^~X(M) + (1 - e- ai / n )nPi(M)\ 



< c 3t iPi(A), 



and so 



Q t {X(A) > 0} 2 P t [exp{dX(M)}} <c±J2 (V P i( A )f 



AeA 



AeA 



Similarly, 



i[X(A)l{ X (A)>2}\ 
< P, 



= P 

= exp 
-Pi 



X(A)1 {X{A) > 2} exp | ~X(M) + (1 - e-^/iPiiM) 

X(A)(1 - l {X (A)<i } exp \^X{M) + (1 - e-^ n )^Pi{M) 
{(l- e -W«) M p.(^)| 



X(A)exp\ —X(M) 



exp\ —X(M\A) 



Pi{X(A) = 1} 



< c 5 LiPi(A) exp 1 — } exp <^ fiP^M) 



- 1 



- exp ^iPi{M \ A) I — - 1 ) \ nPi(A) exp{-(xPi(A)} 
< ceitiP^A)) 2 , 



and so 



J2 Q l [X{A)l {x{A) > 2} ]P t [exp{cX{M)}] < c 7 ^( M P 4 (yl)) 5 



AeA 



AeA 
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Also, 



Pi 



M 
= Pi 

= Pi 



Qi{X(A(x)) > 0}dX(x)exp{cX(M)} 

I Q t {X(A)>0}l {xeA} dX(x)exp{£X(M)} 
a&a Jm 

J2 Q l {X{A) > 0}X(A)exp{£X(M)} 



LAeA 



AeA 



by arguments along the lines of those above. 
By Corollary [7l2l 



/ V { q ] {X{A{x)) > 2} dX{x) exp{~cX{M)} 
Jm 



< exp 



+ (g^fc/n + i _ 2e- ak ' n )^P l {M) 



y n 



x P, 



(J 1 {(g+6*)(A(x))>2} exp l^-g(M)^ dPi(g)J dX(x) 



I M 

x exp{cX(M)} 



AeA 
x exp{cX(M)} 



1{ 9 (A)>1} exp i — g(M) > dPi(g) l{ xeA } dX(x) 

M \.l I n 



l{x(A)>i} exp \ — X(M) 



ak 



P l [X{A)exp{cX{M)}] 



= c 9 J2p 

AeA 

<c w J2(VP(A)) 2 , 

AeA 

via arguments we have used before. 
Observe that 



g{xeM: Qi{X(A(x)) > 0} > - J. < 2 J Qi{X{A{x)) > 0} dg(x), 



and so 



Pi 



X{ x e M : Qi{X(A(x)) > 0} > - } exp{cX(M)} 



<cn ^2(f^Pi(A))' 
AeA 



from above. 
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Lastly, 

J2 P * [Mx(A)>2} ex V {cX(M)}] 



AeA 



= Y. P - [Mx(A)>2 } exp{cX(A)}] P t [exp{dX(M \ A)}} 
AeA 

<c 12 ^(^(A)) 2 . 



AeA 



Substituting these estimates into (8.51 and recalling (8.3) gives 

\\Qk-VQ k \\ Was 

i=o V J AeA ) 

where c and c are constants that depend on a, T, po(M.) and v{M). 
It follows from Corollary |7.3| that 



*-llo.-w.||».^E ir ^n^!5=, 

where 0' and (3" are constants that depend on a, T, po(Ai), and the con- 

stants that appear in the shattering condition for the measures po and v with 
respect to the recombination operation M. 

It is clear that a k tends to as n — > oo. Moreover, from (8.2), 

. ■- \ logro 



\ n n n z J 



for a suitable constant 7 that does not depend on n, so that b k also tends to as 
n — > 00, completing the proof. 



CHAPTER 9 



Convergence to Poisson of iterated recombination 



Our main result, Theorem |5.4| states that when n is large the discrete gen- 
eration mutation-selection process, when started in a probability measure on the 
genotype space Q that is the distribution of Poisson random measure on the lo- 
cus space Ai, stays close to the set of such probability measures out to numbers 
of generations that are of order n. In particular, this conclusion holds when the 
discrete generation starts with the entire population being completely wild type at 
every locus, so that the initial condition is the distribution of the Poisson random 
measure with zero intensity. 

In this section we provide a complement to this observation by showing that 
the recombination process acting alone rapidly shuffles the distribution of a non- 
Poisson random measure on Ai to produce a probability measure on Q that is close 
to the distribution of a Poisson random measure on Ai. 

Definition 9.1. A distribution P on Q is dispersive if there is a constant j3 
such that for any Borel A C Ai, 

g{A)\ {g{A) > 2} dP{g) < P^P(A) 2 . 

Of course, the distribution of a Poisson random measure is always dispersive. 

PROPOSITION 9.2. For all n and k, ii9\ k P = [iP . Ij 'the recombination measure 
is shattering with respect to /j,P, and P is dispersive, then 5R fc P converges to tyP, 
as k — > oo, with 

(9.1) \\m k P - <pP|| Was < (3/3 + 2) (v{M) 2 V 2av{M)) (k + l)" 1 . 

Proof. We note first that 

liK k+1 P{B) = J JJ^g 1 \ R +g 2 \ Rc )(B)d f im k P(g 1 )d^m k P(g 2 )dTZ(R) = fx n (B). 

Let A — (Ai, . . . ,An) be a partition of M.. Then, D\aP defines a random 
genotype that has independent components on each A i7 with corresponding distri- 
butions P\a ■ The distance between two probabilities on genotypes is the sum of 
the distances between the restrictions to a partition of Ai. Thus, 

JV 

HrR fc . Pi — TT „, , 

\Ai 1 1 Was 



|^p-n MF || Was = YJ*A p \ a~^\ 



(9-2) 



<EII^L-n 

i=l 



P P \Ai II Was' 
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Pick any Borel subset A of X. Define \i A to be the measure on A defined for Borel 
subsets B C A by 



li A {B) = f g(B)l {g{A)=1} dP(g). 
Jg 



Clearly /j,a < \xP- Also define the sub-probability measure Pa on genotypes re- 
stricted to A by 

PaW = / f(g\ A )l {g(A )<i } dP(g). 
Jg 

Then, [ia is the first-moment measure of Pa- Note also that 

(9-3) \iP{A) = ha{A) + f g(A)\ {g{A) >2}. 

Jg 



We have 

\\ p \, 

(9.4) 

We bound first 
(9.5) 



"^H A II Was 
^ I^L^^IIwas 



Pa-H 



1 1 Was ' 



P \a ~ P A\w as ^ P{9 ■■ 9(A) > 2} < faP(Af 



Next, for any / : Q ->• K with ||/|| L i P < 1, 
|Pa/-H Ma /| < |P{ 3 :<?(A) = 0}-n AIA {.g:.g(A) = 0}| 

+ |-PA/l{g: ff (A) = l} - n AIA /l {g:9(A) = 1} | + n MA { ff : fll(A) > 2} 

1 - fi A (A) + P{g : g(A) > 2} - e -^( A ) 

+ y A {f) (l-e-^)| + ^ 
M P(A) 2 



< 



< P{<? : g(A) > 2} 



M^) 2 + ^4r^- 



Thus, 
(9.6) 



P4-n 



^ 1 1 Was 



< (/3 + 2) Ai P(A) 2 



Finally, by Lemma 2.7 



A II Was 



+ \» A {A) -»P(A)\ + -^ A {A) 2 + l iP{Af) 



For any / : A — ► K with ||/||Lip < 1> since is the first- moment measure of Pa, 
we have by Campbell's Theorem [DVJ88I (6.4.11)], 



\HAf - fJ-Pf\ = 



g[f]dP A (g) - f g[f]dP(g) 
i Jg 

9[f]l{g(A)>2}dP(g). 



Thus, 
(9.7) 



\U, A -U, P \ A \\ Was <P^P(A)' 



Putting (9.5), (9.61 and (9.7 1 into (9.4 1, we get 



(9.8) 



l P L- n Mp|Allwas<( 3 / 3 + 2 )^) 2 
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By ( p^2| , then, 

(9-9) ||$R fc P-IV|| Was < (3/3 + 2)E|rr| 2 , 

where the expectation is take n wi th respect to partitions 7r chosen from the distri 
bution SH fe . Applying Lemma 



7.2 



completes the proof of (9.1 1. 



□ 



Note that Proposition |9.2| is essentially a point-process version of Le Cam's 
Poisson convergence result of [LC60 . 



APPENDIX A 



An expectation approximation 



The following lemma, which gives error bounds for this approximation, was 
used several times in this work. 

Lemma A.l. Let T be a nonnegative random variable with finite second mo- 
ment. Then, 

—E[T] + l - Var(T)e E[T] > logE [e- T ] > —E[T]. 

and so 

< logE [e- T ] > +E[T] < * Var(T)e E[T] . 
In particular, if T is bounded by a constant r , then 

< logE [e- T ] > +E[T] < \-r 2 e T . 

PROOF. The second inequality holds directly from Jensen's inequality, by the 
convexity of x t— > e~ x . 

2 

The function x e~ x — ^- is concave for x > 0, so that by Jensen's inequality, 

T 2 1 . _ E[T] E[T] 2 
2 



E 



< e 



Consequently, 
E [e 



< e 



-E[T] 



+ l - Var(T) = e- E ^ (l+l Var[T]e E I T ^ . 



Taking logarithms of both sides, and using the bound log(l + x) < x completes the 
proof. □ 
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